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FIGURE 1 

GTTACTGCACCCCAAAACAGGTCTGGCCACGGCCATGAGCATGCTGAGCCATCATGCCCACCGTGGATGACATTCTG 
GAGCAGGTTGGGGAGTCTGGCTGGTTCCAGAAGCAAGCCTTCCTCATCTTATGCCTGCTGTCGGCTGCCTTTGCGCC 
CATCTGTGTGGGCATCGTCTTCCTGGGTTTCACACCTGACCACCACTGCCAGAGTCCTGGGGTGGCTGAGCTGAGCC 
AGCGCTGTGGCTGGAGCCCTGCGGAGGAGCTGAACTATACAGTGCCAGGCCTGGGGCCCGCGGGCGAGGCCTTCCTT 
GGCCAGTGCAGGCGCTATGAAGTGGACTGGAACCAGAGCGCCCTCAGCTGTGTAGACCCCCTGGCTAGCCTGGCCAC 
CAACAGGAGCCACCTGCCGCTGGGTCCCTGCCAGGATGGCTGGGTGTATGACACGCCCGGCTCTTCCATCGTCACTG 
AGTTCAACCTGGTGTGTGCTGACTCCTGGAAGCTGGACCTCTTTCAGTCCTGTTTGAATGCGGGCTTCTTGTTTGGC 
TCTCTCGGTGTTGGCTACTTTGCAGACAGGTTTGGCCGTAAGCTGTGTCTCCTGGGAACTGTGCTGGTCAACGCGGT 
GTCGGGCGTGCTCATGGCCTTCTCGCCCAACTACATGTCCATGCTGCTCTTCCGCCTGCTGCAGGGCCTGGTCAGCA 
AGGGCAACTGGATGGCTGGCTACACCCTAATCACAGAATTTGTTGGCTCGGGCTCCAGAAGAACGGTGGCGATCATG 
TACCAGATGGCCTTCACGGTGGGGCTGGTGGCGCTTACCGGGCTGGCCTACGCCCTGCCTCACTGGCGCTGGCTGCA 
GCTGGCAGTCTCCCTGCCCACCTTCCTCTTCCTGCTCTACTACTGGTGTGTGCCGGAGTCCCCTCGGTGGCTGTTAT 
CACAAAAAAGAAACACTGAAGCAATAAAGATAATGGACCACATCGCTCAAAAGAATGGGAAGTTGCCTCCTGCTGAT 
TTAAAGATGCTTTCCCTCGAAGAGGATGTCACCGAAAAGCTGAGCCCTTCATTTGCAGACCTGTTCCGCACGCCGCG 
CCTGAGGAAGCGCACCTTCATCCTGATGTACCTGTGGTTCACGGACTCTGTGCTCTATCAGGGGCTCATCCTGCACA 
TGGGCGCCACCAGCGGGAACCTCTACCTGGATTTCCTTTACTCCGCTCTGGTCGAAATCCCGGGGGCCTTCATAGCC 
CTCATCACCATTGACCGCGTGGGCCGCATCTACCCCATGGCCATGTCAAATTTGTTGGCGGGGGCAGCCTGCCTCGT 
CATGATTTTTATCTCACCTGACCTGCACTGGTTAAACATCATAATCATGTGTGTTGGCCGAATGGGAATCACCATTG 
CAATACAAATGATCTGCCTGGTGAATGCTGAGCTGTACCCCACATTCGTCAGGAACCTCGGAGTGATGGTGTGTTCC 
TCCCTGTGTGACATAGGTGGGATAATCACCCCCTTCATAGTCTTCAGGCTGAGGGAGGTCTGGCAAGCCTTGCCCCT 
CATTTTGTTTGCGGTGTTGGGCCTGCTTGCCGCGGGAGTGACGCTACTTCTTCCAGAGACCAAGGGGGTCGCTTTGC 
CAGAGACCATGAAGGACGCCGAGAACCTTGGGAGAAAAGCAAAGCCCAAAGAAAACACGATTTACCTTAAGGTCCAA 
ACCTCAGAACCCTCGGGCACC TGA GAGAGATGTTTTGCGGCGATGTCGTGTTGGAGGGATGAAGATGGAGTTATCCT 
CTGCAGAAATTCCTAGACGCCTTCACTTCTCTGTATTCTTCCTCATACTTGCCTACCCCCAAATTAATATCAGTCCT 
AAAGAAAAAAAAAAAAAAAAAA 
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FIGURE 2 

AATTAGCATAACCCTTCCTCAGGAAGAGTGAGATTTTATATTTGACAATAAAGTGTTAGACTCCATTTCTAAATACC 
AGACTTCAAAAGATAAGGTTCAAAAGTGTTATAAGAAGATATTCCTTTTTTTGTCCTAGAGAACTTATTTTCCTGTG 
AAAATGCCTACCACAAAGAAGACATTGATGTTCTTATCAAGCTTTTTCACCAGCCTTGGGTCCTTCATTGTAATTTT 
GCTCTATTCTTGGGACACAAGCATGGATCACCAGTACAATTGCTGTTAGAGACTCTGCTTCAAATGGGAGCATTTTC 
ATCACTTACGGACTTTTTCGTGGGGAGAGTAGTGAAGAATTGAGTCACGGACTTGCAGAACCAAAGAAAAAGTTTGC 
AGTTTTAGAGATACTGAATAATTCTTCCCAAAAAACTCTGCATTCGGTGACTATCCTGTTCCTGGTCCTGAGTTTGA 
TCACGTCGCTGCTGAGCTCTGGGTTTACCTTCTACAAACAGCATCAGCAACCCTTACCAGACATTCCTGGGGCCCGA 
CGGGGGTGTACACCTGGGAACGGGCTTCGGNGCATNCTTCGTTTTTGGTGACCATGATACTGGTTTGNGGGNAACAC 
GCAGGTCCCACCCAACTCTTCCGAANAAGTTGTTTCCAAAATGCTTTACCCCCGGNAACCANCCAGTAAAGGAACGA 
CCCCACAGGTACCGGATACTCGTGCTGGCTCATACTGG 
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FIGURE 3 

GATCAGTGTGTGAGGGAACTGCCATCATCAGGTCTGACAAGTCAGCTTTGGTATTTCTGCTCCTGCAGCTCTTCTGT 
GTTGGCTGTGGATTCTGTGGGAAAGTCCTGGTGTGGCCCTGTGACATGAGCCATTGGCTTAATGTCAAGGTCATTCT 
AGAAGAGCTCATAGTGAGAGGCCATGAGGTAACAGTATTGACTCACTCAAAGCCTTCGTTAATTGACTACAGGAAGC 
CTTCTGCATTGAAATTTGAGGTGGTCCATATGCCACAGGACAGAACAGAAGAAAATGAAATATTTGTTGACCTAGCT 
CTGAATGTCTTGCCAGGCTTATCAACCTGGCAATCAGTTATAAAATTAAATGATTTTTTTGTTGAAATAAGAGGAAC 
TTTAAAAATGATGTGTGAGAGCTTTATCTACAATCAGACGCTTATGAAGAAGCTACAGGAAACCAACTACGATGTAA 
TGCTTATAGACCCTGTGATTCCCTGTGGAGACCTGATGGCTGAGTTGCTTGCAGTCCCTTTTGTGCTCACACTTAGA 
ATTTCTGTAGGAGGCAATATGGAGCGAAGCTGTGGGAAACTTCCAGCTCCACTTTCCTATGTACCTGTGCCTATGAC 
AGGACTAACAGACAGAATGACCTTTCTGGAAAGAGTAAAAAATTCAATGCTTTCAGTTTTGTTCCACTTCTGGATTC 
AGGATTACGACTATCATTTTTGGGAAGAGTTTTATAGTAAGGCATTAGGAAGGCCCACTACATTATGTGAGACTGTG 
GGAAAAGCTGAGATATGGCTAATACGAACATATTGGGATTTTGAATTTCCTCAACCATACCAACCTAACTTTGAGTT 
TGTTGGAGGATTGCACTGTAAACCTGCCAAAGCTTTGCCTAAGGAAATGGAAAATTTTGTCCAGAGTTCAGGGGAAG 
ATGGTATTGTGGTGTTTTCTCTGGGGTCACTGTTTCAAAATGTTACAGAAGAAAAGGCTAATATCATTGCTTCAGCC 
CTTGCCCAGATCCCACAGAAGGTGTTATGGAGGTACAAAGGAAAAAAACCATCCACATTAGGAGCCAATACTCGGCT 
GTATGATTGGATACCCCAGAATGATCTTCTTGGTCATCCCAAAACCAAAGCTTTTATCACTCATGGTGGAATGAATG 
GGATCTATGAAGCTATTTACCATGGGGTCCCTATGGTGGGAGTTCCCATATTTGGTGATCAGCTTGATAACATAGCT 
CACATGAAGGCCAAAGGAGCAGCTGTAGAAATAAACTTCAAAACTATGACAAGCGAAGATTTACTGAGGGCTTTGAG 
AACAGTCATTACCGATTCCTCTTATAAAGAGAATGCTATGAGATTATCAAGAATTCACCATGATCAACCTGTAAAGC 
CCCTAGATCGAGCAGTCTTCTGGATCGAGTTTGTCATGCGCCACAAAGGAGCCAAGCACCTGCGATCAGCTGCCCAT 
GACCTCACCTGGTTCCAGCACTACTCTATAGATGTGATTGGGTTCCTGCTGACCTGTGTGGCAACTGCTATATTCTT 
GTTCACAAAATGTTTTTTATTTTCCTGTCAAAAATTTAATAAAACTAGAAAGATAGAAAAGAGGGA ATAG ATCTTTC 
CAAATTCAAGAAAGACCTGATGGGGTAATCCTGTTAATTCCAGCCACATAGAATTTGGTGAAAACCTTGCTATTTTC 
ATATTATCTATTCTGTTATTTTATCTTAGCTATATAGCCTAGAATTCCATGATCATGAGGTTGTGAGTATATCTCAT 
TCTTTCGTTGTATTTTCCTAGGTGTCTTTACTCTCTTCTCTCACTTTGTGACACAAGGACATGAATACATCTAAATT 
TTCCTATTTCTGATATGACTGTTTTGATGATGTCATTACTTCTATAACCTTAAGTGATAGGGTGACATGCAATATGA 
TTATTCCTGGTGTGCGCCCAAACACATGGATATAAAGAGGTAAAAAACTTAAAATTCACAAAATTCAGTAAACCACA 
CAAATCAGGTAAGTGTTCTATGAGATTAGCTGGCTATGAGAAACATAATGATGTTTCTTTTTCAATTTAAATAAGCC 
TTTCTACATAGCCAGCATCAGTGATCTCAGAAAATAAATTGCTAATAATGATGACATGGCATTATGCTTAGAAAAGT 
TTGCTGTATTTCCATAGACCTCATCTAGATGTCATGGCCTACATTTCTGCCATCACTCAACCAATACTTTTTTCTGT 
TTTCTTGATGATAAAAAGACCTTTCTCATGATTGCCATCAAATAACAAAAGAAACTATTTTTTTTCTCACATAGAGA 
ACATGTCAGTAAGATATTCAAGGTGAACAGATATTTTTGGGATTAGTAACTATTTGAAATATGTGGTGATAATTACT ' 
GAGTTTATAAAATTTATTTGATAGTACACTTAAAGAAGATTTATATGTTTATTCTTTAAAAATGATGAATACTCATA 
ATTCTTATCTCTATAATCAAAAGTATAATTTACTGTAGAAAAATAAAGAGATGCTTGTTCTGAAAGTAAGATCAGTG 
AACTGCTTTTCAGTCTCAATCTTTGAGAATTGTAAATTCATCAAATAATTGCTTACATAGTAAAAATTTAAGGTATT 
AGAAAACCTGCATAACAAATAGTATTATATATTAAATATTTTGATATGTAAAGCTCTACACAAAGCTAAATATAGTG 
TAATAATGTTTACACTAGTAAGCAAATATGTTAATCTTCTCATTTTTTTACTGTCATATAATCTTAGTGATATGCCT 
ATTAATAGTTTTAAATAAATAAATTGGCTTATCTGGCTTTTTGAAAATTTTGAAATTCTTACAGATGTTGATTAGGT 
ATATCTACAAATTAATTTCAATTTTAAAATGATGATATAAAAATAAATATAAGTATTTTTCTTGTGTATGTATACAA 
TAAATATAAATAAAATTGTTTACTGTTTTGAAAGTTTCTTAAGTTTTA 
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FIGURE 4 

GAGGATGCTGCTGGGGAAGATGTGCGCGCCGAGGTACCCGTCCGGGGGCGCCTGGGACCCTCACCCAGGCCAGGCTT 
CGGGGAGAGCATGTATCTGCTCTCGGACAAGGCCACCTCGCCGCTCTCGCTGGATGCTGGCCTCGGGCAGGCCCCCT 
GGAGCGACCTGCTTCTTTGGGCACTGTTGCTGAACAGGGCACAGATGGCCATGTACTTCTGGGAGATGGGTTCCAAT 
GCAGTTTCCTCAGCTCTTGGGGCCTGTTTGCTGCTCCGGGTGATGGCACGCCTGGAGCCTGACGCTGAGGAGGCAGC 
ACGGAGGAAAGACCTGGCGTTCAAGTTTGAGGGGATGGGCGTTGACCTCTTTGGCGAGTGCTATCGCAGCAGTGAGG 
TGAGGGCTGCCCGCCTCCTCCTCCGTCGCTGCCCGCTCTGGGGGGGATGCCACTTGCCTCCAGCTGGCCATGCAAGC 
TGACGCCCGTGCCTTCTTTGCCCAGGATGGGGTACAGTCTCTGCTGACACAGAAGTGGTGGGGAGATATGGCCAGCA 
CTACACCCATCTGGGCCCTGGTTCTCGCCTTCTTTTGCCCTCCACTCATCTACACCCGCCTCATCACCTTCAGGAAA 
TCAGAAGAGGAGCCCACACGGGAGGAGCTAGAGTTTGACATGGATAGTGTCATTAATGGGGAAGGGCCTGTCGGGAC 
GGCGGACCCAGCCGAGAAGACGCCGCTGGGGGTCCCGCGCCAGTCGGGCCGTCCGGGTTGCTGCGGGGGCCGCTGCG 
GGGGGCGCCGGTGCCTACGCCGCTGGTTCCACTTCTGGGGCGCGCCGGTGACCATCTTCATGGGCAACGTGGTCAGC 
TACCTGCTGTTCCTGCTGCTTTTCTCGCGGGTGCTGCTCGTGGATTTCCAGCCGGCGCCGCCCGGCTCCCTGGAGCT 
GCTGCTCTATTTCTGGGCTTTCACGCTGCTGTGCGAGGAACTGCGCCAGGGCCTGAGCGGAGGCGGGGGCAGCCTCG 
CCAGCGGGGGCCCCGGGCCTGGCCATGCCTCACTGAGCCAGCGCCTGCGCCTCTACCTCGCCGACAGCTGGAACCAG 
TGCGACCTAGTGGCTCTCACCTGCTTCCTCCTGGGCGTGGGCTGCCGGCTGACCCCGGGTTTGTACCACCTGGGCCG 
CACTGTCCTCTGCATCGACTTCATGGTTTTCACGGTGCGGCTGCTTCACATCTTCACGGTCAACAAACAGCTGGGGC 
CCAAGATCGTCATCGTGAGCAAGATGATGAAGGACGTGTTCTTCTTCCTCTTCTTCCTCGGCGTGTGGCTGGTAGCC 
TATGGCGTGGCCACGGAGGGGCTCCTGAGGCCACGGGACAGTGACTTCCCAAGTATCCTGCGCCGCGTCTTCTACCG 
TCCCTACCTGCAGATCTTCGGGCAGATTCCCCAGGAGGACATGGACGTGGCCCTCATGGAGCACAGCAACTGCTCGT 
CGGAGCCCGGCTTCTGGGCACACCCTCCTGGGGCCCAGGCGGGCACCTGCGTCTCCCAGTATGCCAACTGGCTGGTG 
GTGCTGCTCCTCGTCATCTTCCTGCTCGTGGCCAACATCCTGCTGGTCAACTTGCTCATTGCCATGTTCAGTTACAC 
ATTCGGCAAAGTACAGGGCAACAGCGATCTCTACTGGAAGGCGCAGGTTACCGCCTCATCCGGGAATTCCACTCTCG 
GCCCGCGCTGGCCCCGCCCTTTATCGTCATCTCCCACTTGCGCCTCCTGCTCAGGCAATTGTGCAGGCGACCCCGGA 
GCCCCCAGCCGTCCTCCCCGGCCCTCGAGCATTTCCGGGTTTACCTTTCTAAGGAAGCCGAGCGGAAGTGCTAACGT 
GGGAATCGGTGCATAAGGAGAACTTTCTGCTGGCACGCGCTAGGGACAAGCGGGAGAGCGACTCCGAGCGTCTGAAG 
CGCACGTCCCAGAAGGTGGACTTGGCACTGAAACAGCTGGGACACATCCGCGAGTACGAACAGCGCCTGAAAGTGCT 
GGAGCGGGAGGTCCAGCAGTGTAGCCGCGTCCTGGGGTGGGTGGCCGAGGCCCTGAGCCGCTCTGCCTTGCTGCCCC 
CAGGTGGGCCGCCACCCCCTGACCTGCCTGGGTCCAAAGACTGAGCCCTGCTGGCGGACTTCAAGGAGAAGCCCCCA 
CAGGGGATTTTGCTCCTAGAGTAAGGCTCATCTGGGCCTCGGCCCCCGCACCTGGTGGCCTTGTCCTTGAGGTGAGC 
CCCATGTCCATCTGGGCCACTGTCAGGACCACCTTTGGGAGTGTCATCCTTACAAACCACAGCATGCCCGGCTCCTC 
CCAGAACCAGTCCCAGCCTGGGAGGATCAAGGCCTGGATCCCGGGCCGTTATCCATCTGGAGGCTGCAGGGTCCTTG 
GGGTAACAGGGACCACAGACCCCTCACCACTCACAGATTCCTCACACTGGGGAAATAAAGCCATTTCAGAGG 
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FIGURE 5 

TNCCCGCAGAACCAGGAAAGTAACGGCTACAGACAGTGAGAAATAGTTTCGCTCGCCGGCTAGAAAAACTCTGTCGG 
TACCAACCCCAGAGCGTTGAGAGCAGCCCACCTCCACGCTTCCTTAACGGAGAGGTGCAGGACTCAGACTTCACCAG 
CCCACTCGGTCCCAGCCTTGTACGCAAAGAGACGCCAAGGACGCGCTCTCCCGCGTCCAGGCAGCCCCAGCTTGCTG 
GCTTGCCTGCCCGCCTGCGTGCAGCACTCGGCCGGCGTGCAGCATGACCCTGTGGAACGGCGTACTGCCTTTTTACC 
CCCAGCCCCGGCATGCCGCAGGCTTCAGCGTTCCACTGCTCATCGTTATTCTAGTGTTTTTGGCTCTAGCAGCAAGC 
TTCCTGCTCATCTTGCCGGGGATCCGTGGCCACTCGCGCTGGTTTTGGTTGGTGAGAGTTCTTCTCAGTCTGTTCAT 
AGGCGCAGAAATTGTGGCTGTGCACTTCAGTGCAGAATGGTTCGTGGGTACAGTGAACACCAACACATCCTACAAAG 
CCTTCAGCGCAGCGCGCGTTACAGCCCGTGTCGGTCTGCTCGTGGGCCTGGAGGGCATTAATATTACACTCACAGGG 
ACCCCAGTGCATCAGCTGAACGAGACCATTGACTACAACGAGCAGTTCACCTGGCGTCTGAAAGAGAATTACGCCGC 
GGAGTACGCGAACGCACTGGAGAAGGGGCTGCCGGACCCAGTGCTCTACCTGGCGGAGAAGTTCACACCGAGTAGCC 
CTTGCGGCCTGTACCACCAGTACCACCTGGCGGGACACTACGCCTCGGCCACGCTATGGGTGGCGTTCTGCTTCTGG 
CTCCTCTCCAACGTGCTGCTCTCCACGCCGGCCCCGCTCTACGGAGGCCTGGCACTGCTGACCACCGGAGCCTTCGC 
GCTCTTCGGGGTCTTCGCCTTGGCCTCCATCTCTAGCGTGCCGCTCTGCCCGCTCCGCCTAGGCTCCTCCGCGCTCA 
CCACTCAGTACGGCGCCGCCTTCTGGGTCACGCTGGCAACCGGCGTCCTGTGCCTCTTCCTCGGAGGGGCCGTGGTG 
AGTCTCCAGTATGTTCGGCCCAGCGCTCTTCGCACCCTTCTGGACCAAAGCGCCAAGGACTGCAGCCAGGAGAGAGG 
GGGCTCACCTCTTATCCTCGGCGACCCACTGCACAAGCAGGCCGCTCTCCCAGACTTAAAATGTATCACCACTAACC 
TGTGAGGGGGACCCAATCTGGACTCCTTCCCCGCCTTGGGACATCGCAGGCCGGGAAGCAGTGCCCGCCAGGCCTGG 
GCCAGGAGAGCTCCAGGAAGGGCACTGAGCGCTGCTGGCGCGAGGCCTCGGACATCCGCAGGCACCAGGGAAAGTCT 
CCTGGGGCGATCTGTAAATAAACCTTTTTTTCTTTTGTTTTTTAAAAAAAAATAAAAGTCGACC 
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FIGURE 6 

CTGCAGGCTTCAGGAGGGGACACAAGCATGGAGCGGCTTTGGGGTCTATTCCAGAGAGCGCAACAACTGTCCCCAAG 
ATCCTCTCAGACCGTCTACCAGCGTGTGGAAGGCCCCCGGAAAGGGCACCTGGAGGAGGAAGAGGAAGACGGGGAGG 
AGGGGGCGGAGACATTGGCCCACTTCTGCCCCATGGAGCTGAGGGGCCCTGAGCCCCTGGGCTCTAGACCCAGGCAG 
CCAAACCTCATTCCCTGGGCGGCAGCAGGACGGAGGGCTGCCCCCTACCTGGTCCTGACGGCCCTGCTGATCTTCAC 
TGGGGCCTTCCTACTGGGCTACGTCGCCTTCCGAGGGTCCTGCCAGGCGTGCGGAGACTCTGTGTTGGTGGTCAGTG 
AGGATGTCAACTATGAGCCTGACCTGGATTTCCACCAGGGCAGACTCTACTGGAGCGACCTCCAGGCCATGTTCCTG 
CAGTTCCTGGGGGAGGGGCGCCTGGAGGACACCATCAGGCAAACCAGCCTTCGGGAACGGGTGGCAGGCTCGGCCGG 
GATGGCCGCTCTGACTCAGGACATTCGCGCGGCGCTCTCCCGCCAGAAGCTGGACCACGTGTGGACCGACACGCACT 
ACGTGGGGCTGCAATTCCCGGATCCGGCTCACCCCAACACCCTGCACTGGGTCGATGAGGCCGGGAAGGTCGGAGAG 
CAGCTGCCGCTGGAGGACCCTGACGTCTACTGCCCCTACAGCGCCATCGGCAACGTCACGGGAGAGCTGGTGTACGC 
CCACTACGGGCGGCCCGAAGACCTGCAGGACCTGCGGGCCAGGGGCGTGGATCCAGTGGGCCGCCTGCTGCTGGTGC 
GCGTGGGGGTGATCAGCTTCGCCCAGAAGGTGACCAATGCTCAGGACTTCGGGGCTCAAGGAGTGCTCATATACCCA 
GAGCCAGCGGACTTCTCCCAGGACCCACCCAAGCCAAGCCTGTCCAGCCAGCAGGCAGTGTATGGACATGTGCACCT 
GGGAACTGGAGACCCCTACACACCTGGCTTCCCTTCCTTCAATCAAACCCAGTTCCCTCCAGTTGCATCATCAGGCC 
TTCCCAGCATCCCAGCCCAGCCCATCAGTGCAGACATTGCCTCCCGCCTGCTGAGGAAGCTCAAAGGCCCTGTGGCC 
CCCCAAGAATGGCAGGGGAGCCTCCTAGGCTCCCCTTATCACCTGGGCCCCGGGCCACGACTGCGGCTAGTGGTCAA 
CAATCACAGGACCTCCACCCCCATCAACAACATCTTCGGCTGCATCGAAGGCCGCTCAGAGCCAGATCACTACGTTG 
TCATCGGGGCCCAGAGGGATGCATGGGGCCCAGGAGCAGCTAAATCCGCTGTGGGGACGGCTATACTCCTGGAGCTG 
GTGCGGACCTTTTCCTCCATGGTGAGCAACGGCTTCCGGCCCCGCAGAAGTCTCCTCTTCATCAGCTGGGACGGTGG 
TGACTTTGGAAGCGTGGGCTCCACGGAGTGGCTAGAAGGCTACCTCAGCGTGCTGCACCTCAAAGCCGTAGTGTACG 
TGAGCCTGGACAACGCAGTGCTGGGGGATGACAAGTTTCATGCCAAGACCAGCCCCCTTCTGACAAGTCTCATTGAG 
AGTGTCCTGAAGCAGGTGGATTCTCCCAACCACAGTGGGCAGACTCTCTATGAACAGGTGGTGTTCACCAATCCCAG 
CTGGGATGCTGAGGTGATCCGGCCCCTACCCATGGACAGCAGTGCCTATTCCTTCACGGCCTTTGTGGGAGTCCCTG 
CCGTCGAGTTCTCCTTTATGGAGGACGACCAGGCCTACCCATTCCTGCACACAAAGGAGGACACTTATGAGAACCTG 
CATAAGGTGCTGCAAGGCCGCCTGCCCGCCGTGGCCCAGGCCGTGGCCCAGCTCGCAGGGCAGCTCCTCATCCGGCT 
CAGCCACGATCGCCTGCTGCCCCTCGACTTCGGCCGCTACGGGGACGTCGTCCTCAGGCACATCGGGAACCTCAACG 
AGTTCTCTGGGGACCTCAAGGCCCGCGGGCTGACCCTGCAGTGGGTGTACTCGGCGCGGGGGGACTACATCCGGGCG 
GCGGAAAAGCTGCGGCAGGAGATCTACAGCTCGGAGGAGAGAGACGAGCGACTGACACGCATGTACAACGTGCGCAT 
AATGCGGGTGGAGTTCTACTTCCTTTCCCAGTACGTGTCGCCAGCCGACTCCCCGTTCCGCCACATCTTCATGGGCC 
GTGGAGACCACACGCTGGGCGCCCTGCTGGACCACCTGCGGCTGCTGCGCTCCAACAGCTCCGGGACCCCCGGGGCC 
ACCTCCTCCACTGGCTTCCAGGAGAGCCGTTTCCGGCGTCAGCTAGCCCTGCTCACCTGGACGCTGCAAGGGGCAGC 
CAATGCGCTTAGCGGGGATGTCTGGAACATTGATAACAACTTC TGA GGCCCTGGGGATCCTCACATCCCCGTCCCCC 
AGTCAAGAGCTCCTCTGCTCCTCGCTTGAATGATTCAGGGTCAGGGAGGTGGCTCAGAGTCCACCTCTCATTGCTGA 
TCAATTTCTCATTACCCCTACACATCTCTCCACGGAGCCCAGACCCCAGCACAGATATCCACACACCCCAGCCCTGC 
AGTGTAGCTGACCCTAATGTGACGGTCATACTGTCGGTTAATCAGAGAGTAGCATCCCTTCAATCACAGCCCCTTCC 
CCTTTCTGGGGTCCTCCATACCTAGAGACCACTCTGGGAGGTTTGCTAGGCCCTGGGACCTGGCCAGCTCTGTTAGT 
GGGAGAGATCGCTGGCACCATAGCCTTATGGCCAACAGGTGGTCTGTGGTGAAAGGGGCGTGGAGTTTCAATATCAA 
T AAAC C ACC T GAT AT C AAT AAG C C AAAA 
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FIGURE 7 

GCTGGAGCATCCCGCTCTGGTGCCGCTGCAGCCGGCAGAGATGGTTGAGCTCATGTTCCCGCTGTTGCTCCTCCTTC 
TGCCCTTCCTTCTGTATATGGCTGCGCCCCAAATCAGGAAAATGCTGTCCAGTGGGGTGTGTACATCAACTGTTCAG 
CTTCCTGGGAAAGTAGTTGTGGTCACAGGAGCTAATACAGGTATCGGGAAGGAGACAGCCAAAGAGCTGGCTCAGAG 
AGGAGCTCGAGTATATTTAGCTTGCCGGGATGTGGAAAAGGGGGAATTGGTGGCCAAAGAGATCCAGACCACGACAG 
GGAACCAGCAGGTGTTGGTGCGGAAACTGGACCTGTCTGATACTAAGTCTATTCGAGCTTTTGCTAAGGGCTTCTTA 
GCTGAGGAAAAGCACCTCCACGTTTTGATCAACAATGCAGGAGTGATGATGTGTCCGTACTCGAAGACAGCAGATGG 
CTTTGAGATGCACATAGGAGTCAACCACTTGGGTCACTTCCTCCTAACCCATCTGCTGCTAGAGAAACTAAAGGAAT 
CAGCCCCATCAAGGATAGTAAATGTGTCTTCCCTCGCACATCACCTGGGAAGGATCCACTTCCATAACCTGCAGGGC 
GAGAAATTCTACAATGCAGGCCTGGCCTACTGTCACAGCAAGCTAGCCAACATCCTCTTCACCCAGGAACTGGCCCG 
GAGACTAAAAGGCTCTGGCGTTACGACGTATTCTGTACACCCTGGCACAGTCCAATCTGAACTGGTTCGGCACTCAT 
CTTTCATGAGATGGATGTGGTGGCTTTTCTCCTTTTTCATCAAGACTCCTCAGCAGGGAGCCCAGACCAGCCTGCAC 
TGTGCCTTAACAGAAGGTCTTGAGATTCTAAGTGGGAATCATTTCAGTGACTGTCATGTGGCATGGGTCTCTGCCCA 
AGCTCGTAATGAGACTATAGCAAGGCGGCTGTGGGACGTCAGTTGTGACCTGCTGGGCCTCCCAATAGAC TAA CAGG 
CAGTGCAGTTGGACCCAAGAGAAGACTGCAGCAGACTACACAGTACTTCTTGTCAAAATGATTCTCCTTCAAGGTTT 
TCAAAACCTTTAGCACAAAGAGAGCAAAACCTTCCAGCCTTGCCTGCTTGGTGTCCAGTTAAAACTCAGTGTACTGC 
CAGATTCGTCTAAATGTCTGTCATGTCCAGATTTACTTTGCTTCTGTTACTGCCAGAGTTACTAGAGATATCATAAT 
AGGATAAGAAGACCCTCATATGACCTGCACAGCTCATTTTCCTTCTGAAAGAAACTACTACCTAGGAGAATCTAAGC 
TATAGCAGGGATGATTTATGCAAATTTGAACTAGCTTCTTTGTTCACAATTCAGTTCCTCCCAACCAACCAGTCTTC 
ACTTCAAGAGGGCCACACTGCAACCTCAGCTTAACATGAATAACAAAGACTGGCTCAGGAGCAGGGCTTGCCCAGGC 
ATGGTGGATCACCGGAGTCAGTAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCACCTCTACTAAAAATTGTG 
TATATCTTTGTGTGTCTTCCTGTTTATGTGTGCCAAGGGAGTATTTTCACAAAGTTCAAAACAGCCACAATAATCAG 
AGATGGAGCAAACCAGTGCCATCCAGTCTTTATGCAAATGAAATGCTGCAAAGGGAAGCAGATTCTGTATATGTTGG 
TAACTACCCACCAAGAGCACATGGGTAGCAGGGAAGAAGTAAAAAAAGAGAAGGAGAATACTGGAAGATAATGCACA 
AAATGAAGGGACTAGTTAAGGATTAACTAGCCCTTTAAGGATTAACTAGTTAAGGATTAATAGCAAAAGACATTAAA 
TATGCTAACATAGCTATGGAGGAATTGAGGGCAAGCACCCAGGACTGATGAGGTCTTAACAAAAACCAGTGTGGCAAA 
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FIGURE 8 

GCTCGGCCAGGCTGCTGGTACCTGCGTCCGCCCGGCGAGCAGGACAGGCTGCTTTGGTTTGTGACCTCCAGGCAGGA 
CGGCCATCCTCTCCAGA ATG AAGATCTTCTTGCCAGTGCTGCTGGCTGCCCTTCTGGGTGTGGAGCGAGCCAGCTCG 
CTGATGTGCTTCTCCTGCTTGAACCAGAAGAGCAATCTGTACTGCCTGAAGCCGACCATCTGCTCCGACCAGGACAA 
CTACTGCGTGACTGTGTCTGCTAGTGCCGGCATTGGGAATCTCGTGACATTTGGCCACAGCCTGAGCAAGACCTGTT 
CCCCGGCCTGCCCCATCCCAGAAGGCGTCAATGTTGGTGTGGCTTCCATGGGCATCAGCTGCTGCCAGAGCTTTCTG 
TGCAATTTCAGTGCGGCCGATGGCGGGCTGCGGGCAAGCGTCACCCTGCTGGGTGCCGGGCTGCTGCTGAGCCTGCT 
GCCGGCCCTGCTGCGGTTTGGCCCCTGACCGCCCAGACCCTGTCCCCCGATCCCCCAGCTCAGGAAGGAAAGCCCAG 
CCCTTTCTGGATCCCACAGTGTATGGGAGCCCCTGACTCCTCACGTGCCTGATCTGTGCCCTTGGTCCCAGGTCAGG 
CCCACCCCCTGCACCTCCACCTGCCCCAGCCCCTGCCTCTGCCCCAAGTGGGGCCAGCTGCCCTCACTTCTGGGGTG 
GATGATGTGACCTTCCTTGGGGGACTGCGGAAGGGACGAGGGTTCCCTGGAGTCTTACGGTCCAACATCAGGACCAA 
GTCCCATGGACATGCTGACAGGGTCCCCAGGGAGACCGTGTCAGTAGGGATGTGTGCCTGGCTGTGTACGTGGGTGT 
GCAGTGCACGTGAGAGCACGTGGCGGCTTCTGGGGGCCATGTTTGGGGAGGGAGGTGTGCCAGCAGCCTGGAGAGCC 
TCAGTCCCTGTAGCCCCCTGCCCTGGCACAGCTGCATGCACTTCAAGGGCAGCCTTTGGGGGTTGGGGTTTCTGCCA 
CTTCCGGGTCTAGGCCCTGCCCCAAATCCAGCCAGTCCTGCCCCAGCCCACCCCCACATTGGAGCCCTCCTGCTGCT 
TTGGTGCCTCAAATAAATACAGATGTCCCCCAAAAAAAAAAAAAAAAA 
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FIGURE 9 

GAAGTTTCTCACTAGGGTCTTCTCTGGCCCAGCCTTTGACTGAAGCTGGTCTGGAGACAGGGGCATTAGAGAAGTGA 
CTCATAGATGGCCTAAAGAAGCGGGGCCACTCAAGGACCCAGGACAGAGGGAAGAGGGCCAACCCAGCTGGACCACA 
GGCAAACCCCATTGCCTTTGAGAGAAAGAAGAGGACCCGGTGAAACATGCTGCTGCTGAAGAAACACACGGAGGACA 
TCAGCAGCGTCTACGAGATCCGCGAGAGGCTCGGCTCGGGTGCCTTCTCCGAGGTGGTGCTGGCCCAGGAGCGGGGC 
TCCGCACACCTCGTGGCCCTCAAGTGCATCCCCAAGAAGGCCCTCCGGGGCAAGGAGGCCCTGGTGGAGAACGAGAT 
CGCAGTGCTCCGTAGGATCAGTCACCCCAACATCGTCGCTCTGGAGGATGTCCACGAGAGCCCTTCCCACCTCTACC 
TGGCCATGGAACTGGTGACGGGTGGCGAGCTGTTTGACCGCATCATGGAGCGCGGCTCCTACACAGAGAAGGATGCC 
AGCCATCTGGTGGGTCAGGTCCTTGGCGCCGTCTCCTACCTGCACAGCCTGGGGATCGTGCACCGGGACCTCAAGCC 
CGAAAACCTCCTGTATGCCACGCCCTTTGAGGACTCGAAGATCATGGTCTCTGACTTTGGACTCTCCAAAATCCAGG 
CTGGGAACATGCTAGGCACCGCCTGTGGGACCCCTGGATATGTGGCCCCAGAGCTCTTGGAGCAGAAACCCTACGGG 
AAGGCCGTAGATGTGTGGGCCCTGGGCGTCATCTCCTACATCCTGCTGTGTGGGTACCCCCCCTTCTACGACGAGAG 
CGACCCTGAGCTCTTCAGCCAGATCCTGAGGGCCAGCTATGAGTTTGACTXTCCTTTCTGGGATGACATCTCAGAAT 
CAGGCAAAGACTTTATTCGGCACCTTCTGGAGCGAGACCTTCAGAAGAGGTTCACCTGCCAACAGGCCTTGCGGGAC 
CTTTGGATCTTTTGGGACACAGGCTTTGGCAGGGACATCTTAGGGTTTGTCAGTGAGCAGATCCGGAAGAACTTTGC 
TTGGACACACTGGAAGCGAGCCTTCAATGCCACCTTGTTCCTGCGCCACATCCGGAAGCTGGGGCAGATCCCAGAGG 
GCGAGGGGGCCTCTGAGCAGGGCATGGSCCGXCACAGCCACTXAGGCCTTCGTGCTGGCCAGCCCCCCAAGTGG TGA 
TGCCCAGGXAGATGCCGAGGCCAAGTGGAXTGAXCCCCAGATTTXCTTXC 
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GGAACGAGTGGGAACGTAGCTGGTCGCAGAGGGCACCAGCGGCTGCAGGACTTCACCAAGGGACCCTGAGGCTCGTG 
AGCAGGGACCCGCGGTGCGGGTTATGCTGGGGGCTCAGATCACCGTAGACAACTGGACACTCAGGACCACGCCATGG 
AGGAGCTGCAGGATGATTATGAAGACATGATGGAGGAGAATCTGGAGCAGGAGGAATATGAAGACCCAGACATCCCC 
GAGTCCCAGATGGAGGAGCCGGCAGCTCACGACACCGAGGCAACAGCCACAGACTACCACACCACATCACACCCGGG 
TACCCACGAGGTCTATGTGGAGCTGCAGGAGCTGGTGATGGACGAAAAGAACCAGGAGCTGAGATGGATGGAGGCGG 
CGCGCTGGGTGCAACTGGAGGAGAACCTGGGGGAGAATGGGGCCTGGGGCCGCCCGCACCTCTCTCACCTCACCTTC 
TGGAGCCTCCTAGAGCTGCGTAGAGTCTTCACCAAGGGTACTGTCCTCCTAGACCTGCAAGAGACCTCCCTGGCTGG 
AGTGGCCAACCAACTGCTAGACAGGTTTATCTTTGAAGACCAGATCCGGCCTCAGGACCGAGAGGAGCTGCTCCGGG 
CCCTGCTGCTTAAACACAGCCACGCTGGAGAGCTGGAGGCCCTGGGGGGTGTGAAGCCTGCAGTCCTGACACGCTCT 
GGGGATCCTTCACAGCCTCTGCTCCCCCAACACTCCTCACTGGAGACACAGCTCTTCTGTGAGCAGGGAGATGGGGG 
CACAGAAGGGCACTCACCATCTGGAATTCTGGAAAAGATTCCCCCGGATTCAGAGGCCACGTTGGTGCTAGTGGGCC 
GCGCCGACTTCCTGGAGCAGCCGGTGCTGGGCTTCGTGAGGCTGCAGGAGGCAGCGGAGCTGGAGGCGGTGGAGCTG 
CCGGTGCCTATACGCTTCCTCTTTGTGTTGCTGGGACCTGAGGCCCCCCACATCGATTACACCCAGCTTGGCCGGGC 
TGCTGCCACCCTCATGTCAGAGAGGGTGTTCCGCATAGATGCCTACATGGCTCAGAGCCGAGGGGAGCTGCTGCACT 
CCCTAGAGGGCTTCCTGGACTGCAGCCTAGTGCTGCCTCCCACCGATGCCCCCTCCGAGCAGGCACTGCTCAGTCTG 
GTGCCTGTGCAGAGGGAGCTACTTCGAAGGCGCTATCAGTCCAGCCCTGCCAAGCCAGACTCCAGCTTCTACAAGGG 
CCTAGACTTAAATGGGGGCCCAGATGACCCTCTGCAGCAGACAGGCCAGCTCTTCGGGGGCCTGGTGCGTGATATCC 
GGCGCCGCTACCCCTATTACCTGAGTGACATCACAGATGCATTCAGCCCCCAGGTCCTGGCTGCCGTCATCTTCATC 
TACTTTGCTGCACTGTCACCCGCCATCACCTTCGGCGGCCTCCTGGGAGAAAAGACCCGGAACCAGATGGGAGTGTC 
GGAGCTGCTGATCTCCACTGCAGTGCAGGGCATTCTCTTCGCCCTGCTGGGGGCTCAGCCCCTGCTTGTGGTCGGCT 
TCTCAGGACCCCTGCTGGTGTTTGAGGAAGCCTTCTTCTCGTTCTGCGAGACCAACGGTCTAGAGTACATCGTGGGC 
CGCGTGTGGATCGGCTTCTGGCTCATCCTGCTGGTGGTGTTGGTGGTGGCCTTCGAGGGTAGCTTCCTGGTCCGCTT 
CATCTCCCGCTATACCCAGGAGATCTTCTCCTTCCTCATTTCCCTCATCTTCATCTATGAGACTTTCTCCAAGCTGA 
TCAAGATCTTCCAGGACCACCCACTACAGAAGACTTATAACTACAACGTGTTGATGGTGCCCAAACCTCAGGGCCCC 
CTGCCCAACACAGCCCTCCTCTCCCTTGTGCTCATGGCCGGTACCTTCTTCTTTGCCATGATGCTGCGCAAGTTCAA 
GAACAGCTCCTATTTCCCTGGCAAGCTGCGTCGGGTCATCGGGGACTTCGGGGTCCCCATCTCCATCCTGATCATGG 
TCCTGGTGGATTTCTTCATTCAGGATACCTACACCCAGAAACTCTCGGTGCCTGATGGCTTCAAGGTGTCCAACTCC 
TCAGCCCGGGGCTGGGTCATCCACCCACTGGGCTTGCGTTCCGAGTTTCCCATCTGGATGATGTTTGCCTCCGCCCT 
GCCTGCTCTGCTGGTCTTCATCCTCATATTCCTGGAGTCTCAGATCACCACGCTGATTGTCAGCAAACCTGAGCGCA 
AGATGGTCAAGGGCTCCGGCTTCCACCTGGACCTGCTGCTGGTAGTAGGCATGGGTGGGGTGGCCGCCCTCTTTGGG 
ATGCCCTGGCTCAGTGCCACCACCGTGCGTTCCGTCACCCATGCCAACGCCCTCACTGTCATGGGCAAAGCCAGCAC 
CCCAGGGGCTGCAGCCCAGATCCAGGAGGTCAAAGAGCAGCGGATCAGTGGACTCCTGGTCGCTGTGCTTGTGGGCC 
TGTCCATCCTCATGGAGCCCATCCTGTCCCGCATCCCCCTGGCTGTACTGTTTGGCATCTTCCTCTACATGGGGGTC 
ACGTCGCTCAGCGGCATCCAGCTCTTTGACCGCATCTTGCTTCTGTTCAAGCCACCCAAGTATCACCCAGATGTGCC 
CTACGTCAAGCGGGTGAAGACCTGGCGCATGCACTTATTCACGGGCATCCAGATCATCTGCCTGGCAGTGCTGTGGG 
TGGTGAAGTCCACGCCGGCCTCCCTGGCCCTGCCCTTCGTCCTCATCCTCACTGTGCCGCTGCGGCGCGTCCTGCTG 
CCGCTCATCTTCAGGAACGTGGAGCTTCAGTGTCTGGATGCTGATGATGCCAAGGCAACCTTTGATGAGGAGGAAGG 
TCGGGATGAATACGACGAAGTGGCCATGCCTGTG TGA GGGGCGGGCCCAGGCCCTAGACCCTCCCCCACCATTCCAC 
ATCCCCACCTTCCAAGGAAAAGCAGAAGTTCATGGGCACCTCATGGACTCCAGGATCCTCCTGGAGCAGCAGCTGAG 
GCCCCAGGGCTGTGGGTGGGGAAGGAAGGCGTGTCCAGGAGACCTTCCACAAAGGGTAGCCTGGCTTTTCTGGCTGG 
GGATGGCCGATGGGGCCCACATTAGGGGGTTTGTTGCACAGTCCCTCCTGTTGCCACACTTTCACTGGGGATCCCGT 
GCTGGAAGACTTAGATCTGAGCCCTCCCTCTTCCCAGCACAGGCAGGGGTAGAAGCAAAGGCAGGAGGTGGGTGAGC 
GGGTGGGGTGCTTGCTGTGTGACCTTGGGCAAGTCCCTTGACCTTTCCAGCCTATATTTCCTCTTCTGTAAAATGGG 
TATATTGATGATAATACCCACATTACAGGATGGTTACTGAGGACCAAAGATACATGTAAAATAGGGCTTTGTAAACT 
CCACAGGGACTGTTCTATAGCAGTCATCATTTGTCTTTGAACGTACCCAAGGTCACATAGCTGGGATTTGAACTGAG 
CCGTGCAGCT 
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CTCTTTTGCTNGGACTTCACTGTCACTCANGAAAAAGCNGTGAANCTAAAA.CAGAAGAATCTTAGCACTGAGATAAG 
GGAGAACCTGTCAGAGCTCCGTCAGGAGAATTCCAAGTTGACGTTCAATCAGCTGCTGACCCGCTTCTCTGCCTACA 
TGGTAGCCTGGGTTGTCTCTACAGGAGTGGCCATAGCCTGCTGTGCAGCCGTTTATTACCTGGCTGAGTACAACTTA 
GAGTTCCTGAAGACACACAGTAACCCTGGGGCGGTGCTGTTACTGCCTTTCGTTGTGTCCTGCATTAATCTGGCCGT 
GCCATGCATCTACTCCATGTTCAGGCTTGTGGAGAGGTACGAGATGCCACGGCACGAAGTCTACGTTCTCCTGATCC 
GAAACATCTTTTTGAAAATATCAATCATTGGCATTCTTTGTTACTATTGGCTCAACACCGTGGCCCTGTCTGGTGAA 
GAGTGTTGGGAAACCCTCATTGGCCAGGACATCTACCGGCTCCTTCTGATGGATTTTGTGTTCTCTTTAGTCAATTC 
CTTCCTGGGGGAGTTTCTGAGGAGAATCATTGGGATGCAACTGATCACAAGTCTTGGCCTTCAGGAGTTTGACATTG 
CCAGGAACGTTCTAGAACTGATCTATGCACAAACTCTGGTGTGGATTGGCATCTTCTTCTGCCCCCTGCTGCCCTTT 
ATCCAAATGATTATGCTTTTCATCATGTTCTACTCCAAAAATATCAGCCTGATGATGAATTTCCAGCCTCCGAGCAA 
AGCCTGGCGGGCCTCACAGATGATGACTTTCTTCATCTTCTTGCTCTTTTTCCCATCCTTCACCGGGGTCTTGTGCA 
CCCTGGCCATCACCATCTGGAGATTGAAGCCTTCAGCTGACTGTGGCCCTTTTCGAGGTCTGCCTCTCTTCATTCAC 
TCCATCTACAGCTGGATCGACACCCTAAGTACACGGCCTGGCTACCTGTGGGTTGTTTGGATCTATCGGAACCTCAT 
TGGAAGTGTGCACTTCTTTTTCATCCTCACCCTCATTGTGCTAATCATCACCTATCTTTACTGGCAGATCACAGAGG 
GAAGGAAGATTATGATAAGGCTGCTCCATGAGCAGATCATTAATGAGGGCAAAGATAAAATGTTCCTGATAGAAAAA 
TTGATCAAGCTGCAGGATATGGAGAAGAAAGCAAACCCCAGCTCACTTGTTCTGGAAAGGAGAGAGGTGGAGCAACA 
AGGCTTTTTGCATTTGGGGGAACATGATGGCAGTCTTGACTTGCGATCTAGAAGATCAGTTCAAGAAGGTAATCCAA 
GGGCCTGATGACTCTTTTGGTAACCAGACACCAATCAAATAAGGGGAGGAGATGAAAATGGAATGATTTCTTCCATG 
CCACCTGTGCCTTTAGGAACTGCCCAGAAGAAAATCCAAGGCTTTAGCCAGGAGCGGAAACTGACTACCATGTAATT 
ATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGGATTGCTGATTTTTCAAGACAAAATACTTGGGGTT 
TTCCAATAAAGATTGTTGTAATATTGAAATGAGCCTACAAAAACCTAGGAAGAGATAACTAGGGAATAATGTATATT 
ATCTTCAAGAAATGTGTGCAGGAATGATTGGTTCTTAGAAATCTCTCCTGCCAGACTTCCCAGACCTGGCAAAGGTT 
TAGAAACTGTTGCTAAGAAAAGTGGTCCATCCTGAATAAACATGTAATACTCCAGCAGGGATATGAAGCCTCTGAAT 
TGTAGAACCTGCATTTATTTGTGACTTTGAACTAAAGACATCCCCCATGTCCCAAAGGTGGAATACAACCAGAGGTC 
TCATCTCTGAACTTTCTTGCGTACTGATTACATGAGTCTTTGGAGTCGGGGATGGAGGAGGTTCTGCCCCTGTGAGG 
TGTTATACATGACCATCAAAGTCCTACGTCAAGCTAGCTTTGCACAGTGGCAGTACCGTAGCCAATGAGATTTATCC 
GAGACGCGATTATTGCTAATTGGAAATTTTCCCAATACCCCACCGTGATGACTTGAAATATAATCAGCGCTGGCAAT 
TTTTGACAGTCTCTACGGAGACTGAATAAGAAAAAAGAAAAGAAAAGAAATTAGCTGGGTGCGATGGCTTATGCCTG 
TAATCCCGGCACTTTGGGAGGCTGAGGCAAGCGGATCACTTAATGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCGTCTCTACTAAGGATAAAAAAACTGGCTGGGCGTGGTGGTACATGCCT 
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GCTTCAGGGTACAGCTCCCCCGCAGCCAGAAGCCGGGCCTGCAGCCCCTCAGCACCGCTCCGGGACACCCCACCCGC 
TTCCCAGGCGTGACCTGTCAACAGCAACTTCGCGGTGTGGTGAACTCTCTGAGGAAAAACCATTTTGATTATTACTC 
TCAGACGTGCGTGGCAACAAGTGACTGAGACCTAGAAATCCAAGCGTTGGAGGTCCTGAGGCCAGCCTAAGTCGCTT 
CAAAATGGAACGAAGGCGTTTGTGGGGTTCCATTCAGAGCCGATACATCAGCATGAGTGTGTGGACAAGCCCACGGA 
GACTTGTGGAGCTGGCAGGGCAGAGCCTGCTGAAGGATGAGGCCCTGGCCATTGCCGCCCTGGAGTTGCTGCCCAGG 
GAGCTCTTCCCGCCACTCTTCATGGCAGCCTTTGACGGGAGACACAGCCAGACCCTGAAGGCAATGGTGCAGGCCTG 
GCCCTTCACCTGCCTCCCTCTGGGAGTGCTGATGAAGGGACAACATCTTCACCTGGAGACCTTCAAAGCTGTGCTTG 
ATGGACTTGATGTGCTCCTTGCCCAGGAGGTTCGCCCCAGGAGGTGGAAACTTCAAGTGCTGGATTTACGGAAGAAC 
TCTCATCAGGACTTCTGGACTGTATGGTCTGGAAACAGGGCCAGTCTGTACTCATTTCCAGAGCCAGAAGCAGCTCA 
GCCCATGACAAAGAAGCGAAAAGTAGATGGTTTGAGCACAGAGGCAGAGCAGCCCTTCATTCCAGTAGAGGTGCTCG 
TAGACCTGTTCCTCAAGGAAGGTGCCTGTGATGAATTGTTCTCCTACCTCATTGAGAAAGTGAAGCGAAAGAAAAAT 
GTACTACGCCTGTGCTGTAAGAAGCTGAAGATTTTTGCAATGCCCATGCAGGATATCAAGATGATCCTGAAAATGGT 
GCAGCTGGACTCTATTGAAGATTTGGAAGTGACTTGTACCTGGAAGCTACCCACCTTGGCGAAATTTTCTCCTTACC 
TGGGCCAGATGATTAATCTGCGTAGACTCCTCCTCTCCCACATCCATGCATCTTCCTACATTTCCCCGGAGAAGGAA 
GAGCAGTATATCGCCCAGTTCACCTCTCAGTTCCTCAGTCTGCAGTGCCTGCAGGCTCTCTATGTGGACTCTTTATT 
TTTCCTTAGAGGCCGCCTGGATCAGTTGCTCAGGCACGTGATGAACCCCTTGGAAACCCTCTCAATAACTAACTGCC 
GGCTTTCGGAAGGGGATGTGATGCATCTGTCCCAGAGTCCCAGCGTCAGTCAGCTAAGTGTCCTGAGTCTAAGTGGG 
GTCATGCTGACCGATGTAAGTCCCGAGCCCCTCCAAGCTCTGCTGGAGAGAGCCTCTGCCACCCTCCAGGACCTGGT 
CTTTGATGAGTGTGGGATCACGGATGATCAGCTCCTTGCCCTCCTGCCTTCCCTGAGCCACTGCTCCCAGCTTACAA 
CCTTAAGCTTCTACGGGAATTCCATCTCCATATCTGCCTTGCAGAGTCTCCTGCAGCACCTCATCGGGCTGAGCAAT 
CTGACCCACGTGCTGTATCCTGTCCCCCTGGAGAGTTATGAGGACATCCATGGTACCCTCCACCTGGAGAGGCTTGC 
CTATCTGCATGCCAGGCTCAGGGAGTTGCTGTGTGAGTTGGGGCGGCCCAGCATGGTCTGGCTTAGTGCCAACCCCT 
GTCCTCACTGTGGGGACAGAACCTTCTATGACCCGGAGCCCATCCTGTGCCCCTGTTTCATGCCTAAC TAG CTGGGT 
GCACATATCAAATGCTTCATTCTGCATACTTGGACACTAAAGCCAGGATGTGCATGCATCTTGAAGCAACAAAGCAG 
CCACAGTTTCAGACAAATGTTCAGTGTGAGTGAGGAAAACATGTTCAGTGAGGAAAAAACATTCAGACAAATGTTCA 
GTGAGGAAAAAAAGGGGAAGTTGGGGATAGGCAGATGTTGACTTGAGGAGTTAATGTGATCTTTGGGGAGATACATC 
TTATAGAGTTAGAAATAGAATCTGAATTTCTAAAGGGAGATTCTGGCTTGGGAAGTACATGTAGGAGTTAATCCCTG 
T G TAG AC T G T T G T AAAG AAAC T G T T G AAAAT AAAG AG AAG C AAT G T G AAG C AAAAAAAAAAAAAAAAAA 
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GGGCAGGCTCAGTGTGAGTGAACTGGAGGCTTCTCTACAACAJ^ACCCAAAGGAGCATTGCAGGTCCTATTTGCAAC 
CTGAAGTTTGTGACTCTCCTGGTTGCCTTAAGTTCAGAACTCCCATTCCTGGGAGCTGGAGTACAGCTTCAAGACAA 
TGGGTATAATGGATTGCTCATTGCCATTAATCCTCAGGTACCTGAGAATCAGAACCTCATCTCAAACATTAAGGAAA 
TGATAACTGAAGCTTCATTTTACCTATTTAATGCTACCAAGAGAAGAGTATTTTTCAGAAATATAAAGATTTTAATA 
CCTGCCACATGGAAAGCTAATAATAACAGCAAAATAAAACAAGAATCATATGAAAAGGCAAATGTCATAGTGACTGA 
CTGGTATGGGGCACATGGAGATGATCCATACACCCTACAATACAGAGGGTGTGGAAAAGAGGGAAAATACATTCATT 
TCACACCTAATTTCCTACTGAATGATAACTTAACAGCTGGCTACGGATCACGAGGCCGAGTGTTTGTCCATGAATGG 
GCCCACCTCCGTTGGGGTGTGTTCGATGAGTATATCAATGACAAACCTTTCTACATAAATGGGCAAAATCAAATTAA 
AGTGACAAGGTGTTCATCTGACATCACAGGCATTTTTGTGTGTGAAAAAGGTCCTTGCCCCCAAGAAAACTGTATTA 
TTAGTAAGCTTTTTAAAGAAGGATGCACCTTTATCTACAATAGCACCCAAAATGCAACTGCATCAATAATGTTCATG 
CAAAGTTTATCTTCTGTGGTTGAATTTTGTAATGCAAGTACCCACAACCAAGAAGCACCAAACCTACAGAACCAGAT 
GTGCAGCCTCAGAAGTGCATGGGATGTAATCACAGACTCTGCTGACTTTCACCACAGCTTTCCCATGAATGGGACTG 
AGCTTCCACCTCCTCCCACATTCTCGCTTGTACAGGCTGGCGACAAAGTGGTCTGTTTAGTGCTGGATGTGTCCAGC 
AAGATGGCAGAGGCTGACAGACTCCTTCAACTACAACAAGCCGCAGAATTTTATTTGATGCAGATTGTTGAAATTCA 
TACCTTCGTGGGCATTGCCAGTTTCGACAGCAAAGGAGAGATCAGAGCCCAGCTACACCAAATTAACAGCAATGATG 
ATCGAAAGTTGCTGGTTTCATATCTGCCCACCACTGTATCAGCTAAAACAGACATCAGCATTTGTTCAGGGCTTAAG 
AAAGGATTTGAGGTGGTTGAAAAACTGAATGGAAAAGCTTATGGCTCTGTGATGATATTAGTGACCAGCGGAGATGA 
TAAGCTTCTTGGCAATTGCTTACCCACTGTGCTCAGCAGTGGTTCAACAATTCACTCCATTGCCCTGGGTTCATCTG 
CAGCCCCAAATCTGGAGGAATTATCACGTCTTACAGGAGGTTTAAAGTTCTTTGTTCCAGATATATCAAACTCCAAT 
AGCATGATTGATGCTTTCAGTAGAATTTCCTCTGGAACTGGAGACATTTTCCAGCAACATATTCAGCTTGAAAGTAC 
AGGTGAAAATGTCAAACCTCACCATCAATTGAAAAACACAGTGACTGTGGATAATACTGTGGGCAACGACACTATGT 
TTCTAGTTACGTGGCAGGCCAGTGGTCCTCCTGAGATTATATTATTTGATCCTGATGGACGAAAATACTACACAAAT 
AATTTTATCACCAATCTAACTTTTCGGACAGCTAGTCTTTGGATTCCAGGAACAGCTAAGCCTGGGCACTGGACTTA 
CACCCTGAACAATACCCATCATTCTCTGCAAGCCCTGAAAGTGACAGTGACCTCTCGCGCCTCCAACTCAGCTGTGC 
CCCCAGCCACTGTGGAAGCCTTTGTGGAAAGAGACAGCCTCCATTTTCCTCATCCTGTGATGATTTATGCCAATGTG 
AAACAGGGATTTTATCCCATTCTTAATGCCACTGTCACTGCCACAGTTGAGCCAGAGACTGGAGATCCTGTTACGCT 
GAGACTCCTTGATGATGGAGCAGGTGCTGATGTTATAAAAAATGATGGAATTTACTCGAGGTATTTTTTCTCCTTTG 
CTGCAAATGGTAGATATAGCTTGAAAGTGCATGTCAATCACTCTCCCAGCATAAGCACCCCAGCCCACTCTATTCCA 
GGGAGTCATGCTATGTATGTACCAGGTTACACAGCAAACGGTAATATTCAGATGAATGCTCCAAGGAAATCAGTAGG 
CAGAAATGAGGAGGAGCGAAAGTGGGGCTTTAGCCGAGTCAGCTCAGGAGGCTCCTTTTCAGTGCTGGGAGTTCCAG 
CTGGCCCCCACCCTGATGTGTTTCCACCATGCAAAATTATTGACCTGGAAGCTGTAAAAGTAGAAGAGGAATTGACC 
CTATCTTGGACAGCACCTGGAGAAGACTTTGATCAGGGCCAGGCTACAAGCTATGAAATAAGAATGAGTAAAAGTCT 
ACAGAATATCCAAGATGACTTTAACAATGCTATTTTAGTAAATACATCAAAGCGAAATCCTCAGCAAGCTGGCATCA 
GGGAGATATTTACGTTCTCACCCCAAATTTCCACGAATGGACCTGAACATCAGCCAAATGGAGAAACACATGAAAGC 
CACAGAATTTATGTTGCAATACGAGCAATGGATAGGAACTCCTTACAGTCTGCTGTATCTAACATTGCCCAGGCGCC 
TCTGTTTATTCCCCCCAATTCTGATCCTGTACCTGCCAGAGATTATCTTATATTGAAAGGAGTTTTAACAGCAATGG 
GTTTGATAGGAATCATTTGCCTTATTATAGTTGTGACACATCATACTTTAAGCAGGAAAAAGAGAGCAGACAAGAAA 
GAGAATGGAACAAAATTATTATAAATAAATATCCAAAGTGTCTTCCTTCTTAGATATAAGACCCATGGCCTTCGACT 
ACAAAAACATACTAACAAAGTCAAATTAACATCAAAACTGTATTAAAATGCATTGAGTTTTTGTACAATACAGATAA 
GATTTTTACATGGTAGATCAACAATTCTTTTTGGGGGTAGATTAGAAAACCCTTACACTTTGGCTATGAACAAATAA 
TAAAAATTATTCTTTAAAGTAATGTCTTTAAAGGCAAAGGGAAGGGTAAAGTCGGACCAGTGTCAAGGAAAGTTTGT 
TTTATTGAGGTGGAAAAATAGCCCCAAGCAGAGAAAAGGAGGGTAGGTCTGCATTATAACTGTCTGTGTGAAGCAAT 
CATTTAGTTACTTTGATTAATTTTTCTTTTCTCCTTATCTGTGCAGTACAGGTTGCTTGTTTACATGAAGATCATGC 
TATATTTTATATATGTAGCCCCTAATGCAAAGCTCTTTACCTCTTGCTATTTTGTTATATATATTTCAGATGACATC 
TCCCTGCTAATGCTCAGAGATCTTTTTTCACTGTAAGAGGTAACCTTTAACAATATGGGTATTACCTTTGTCTCTTC 
ATACCGGTTTTATGACAAAGGTCTATTGAATTTATTTGTXTGTAAGTTTCTACTCCCATCAAAGCAGCTTTCTAAGT 
TTATTGCCTTGGGTTATTATGGAATGATAGTTATAGCCCCXTATAATGCCTTACCTAGGAAA 
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GTCATATTGAACATTCCAGATACCTATCATTACTCGATGCTGTTGATAACAGCAAGM^GCTTTGAACTCAGGGTCA 
CCACCAGCTATTGGACCTTACTATGAAAACCATGGATACCAACCGGAAAACCCCTATCCCGCACAGCCCACTGTGGT 
CCCCACTGTCTACGAGGTGCATCCGGCTCAGTACTACCCGTCCCCCGTGCCCCAGTACGCCCCGAGGGTCCTGACGC 
AGGCTTCCAACCCCGTCGTCTGCACGCAGCCCAAATCCCCATCCGGGACAGTGTGCACCTCAAAGACTAAGAAAGCA 
CTGTGCATCACCTTGACCCTGGGGACCTTCCTCGTGGGAGCTGCGCTGGCCGCTGGCCTACTCTGGAAGTTCATGGG 
CAGCAAGTGCTCCAACTCTGGGATAGAGTGCGACTCCTCAGGTACCTGCATCAACCCCTCTAACTGGTGTGATGGCG 
TGTCACACTGCCCCGGCGGGGAGGACGAGAATCGGTGTGTTCGCCTCTACGGACCAAACTTCATCCTTCAGATGTAC 
TCATCTCAGAGGAAGTCCTGGCACCCTGTGTGCCAAGACGACTGGAACGAGAACTACGGGCGGGCGGCCTGCAGGGA 
CATGGGCTATAAGAATAATTTTTACTCTAGCCAAGGAATAGTGGATGACAGCGGATCCACCAGCTTTATGAAACTGA 
ACACAAGTGCCGGCAATGTCGATATCTATAAAAAACTGTACCACAGTGATGCCTGTTCTTCAAAAGCAGTGGTTTCT 
TTACGCTGTTTAGCCTGCGGGGTCAACTTGAACTCAAGCCGCCAGAGCAGGATCGTGGGCGGTGAGAGCGCGCTCCC 
GGGGGCCTGGCCCTGGCAGGTCAGCCTGCACGTCCAGAACGTCCACGTGTGCGGAGGCTCCATCATCACCCCCGAGT 
GGATCGTGACAGCCGCCCACTGCGTGGAAAAACCTCTTAACAATCCATGGCATTGGACGGCATTTGCGGGGATTTTG 
AGACAATCTTTCATGTTCTATGGAGCCGGATACCAAGTACAAAAAGTGATTTCTCATCCAAATTATGACTCCAAGAC 
CAAGAACAATGACATTGCGCTGATGAAGCTGCAGAAGCCTCTGACTTTCAACGACCTAGTGAAACCAGTGTGTCTGC 
CCAACCCAGGCATGATGCTGCAGCCAGAACAGCTCTGCTGGATTTCCGGGTGGGGGGCCACCGAGGAGAAAGGGAAG 
ACCTCAGAAGTGCTGAACGCTGCCAAGGTGCTTCTCATTGAGACACAGAGATGCAACAGCAGATATGTCTATGACAA 
CCTGATCACACCAGCCATGATCTGTGCCGGCTTCCTGCAGGGGAACGTCGATTCTTGCCAGGGTGACAGTGGAGGGC 
CTCTGGTCACTTCGAACAACAATATCTGGTGGCTGATAGGGGATACAAGCTGGGGTTCTGGCTGTGCCAAAGCTTAC 
AGACCAGGAGTGTACGGGAATGTGATGGTATTCACGGACTGGATTTATCGACAAATGAAGGCAAACGGCTAATCCAC 
ATGGTCTTCGTCCTTGACGTCGTTTTACAAGAAAACAATGGGGCTGGTTTTGCTTCCCCGTGCATGATTTACTCTTA 
GAGATGATTCAGAGGTCACTTCATTTTTATTAAACAGTGAACTTGTCTGGCTTTGGCACTCTCTGCCATACTGTGCA 
GGCTGCAGTGGCTCCCCTGCCCAGCCTGCTCTCCCTAACCCCTTGTCCGCAAGGGGTGATGGCCGGCTGGTTGTGGG 
CACTGGCGGTCAATTGTGGAAGGAAGAGGGTTGGAGGCTGCCCCCATTGAGATCTTCCTGCTGAGTCCTTTCCAGGG 
GCCAATTTTGGATGAGCATGGAGCTGTCACTTCTCAGCTGCTGGATGACTTGAGATGAAAAAGGAGAGACATGGAAA 
GGGAGACAGCCAGGTGGCACCTGCAGCGGCTGCCCTCTGGGGCCACTTGGTAGTGTCCCCAGCCTACTTCACAAGGG 
GATTTTGCTGATGGGTTCTTAGAGCCTTAGCAGCCCTGGATGGTGGCCAGAAATAAAGGGACCAGCCCTTCATGGGT 
GGTGACGTGGTAGTCACTTGTAAGGGGAACAGAAACATTTTTGTTCTTATGGGGTGAGAATATAGACAGTGCCCTTG 
GTGCGAGGGAAGCAATTGAAAAGGAACTTGCCCTGAGCACTCCTGGTGCAGGTCTCCACCTGCACATTGGGTGGGGC 
TCCTGGGAGGGAGACTCAGCCTTCCTCCTCATCCTCCCTGACCCTGCTCCTAGCACCCTGGAGAGTGAATGCCCCTT 
GGTCCCTGGCAGGGCGCCAAGTTTGGCACCATGTCGGCCTCTTCAGGCCTGATAGTCATTGGAAATTGAGGTCCATG 
GGGGAAATCAAGGATGCTCAGTTTAAGGTACACTGTTTCCATGTTATGTTTCTACACATTGATGGTGGTGACCCTGA 
GTTCAAAGCCATCTT 
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GGATTTCCGGGCTCCATGGCAAGATCCCTTCTCCTGCCCCTGCAGATCCTACTGCTATCCTTAGCCTTGGAAACTGC 
AGGAGAAGAAGCCCAGGGTGACAAGATTATTGATGGCGCCCCATGTGCAAGAGGCTCCCACCCATGGCAGGTGGCCC 
TGCTCAGTGGCAATCAGCTCCACTGCGGAGGCGTCCTGGTCAATGAGCGCTGGGTGCTCACTGCCGCCCACTGCAAG 
ATGAATGAGTACACCGTGCACCTGGGCAGTGATACGCTGGGCGACAGGAGAGCTCAGAGGATCAAGGCCTCGAAGTC 
ATTCCGCCACCCCGGCTACTCCACACAGACCCATGTTAATGACCTCATGCTCGTGAAGCTCAATAGCCAGGCCAGGC 
TGTCATCCATGGTGAAGAAAGTCAGGCTGCCCTCCCGCTGCGAACCCCCTGGAACCACCTGTACTGTCTCCGGCTGG 
GGCACTACCACGAGCCCAGATGTGACCTTTCCCTCTGACCTCATGTGCGTGGATGTCAAGCTCATCTCCCCCCAGGA 
CTGCACGAAGGTTTACAAGGACTTACTGGAAAATTCCATGCTGTGCGCTGGCATCCCCGACTCCAAGAAAAACGCCT 
GCAATGGTGACTCAGGGGGACCGTTGGTGTGCAGAGGTACCCTGCAAGGTCTGGTGTCCTGGGGAACTTTCCCTTGC 
GGCCAACCCAATGACCCAGGAGTCTACACTCAAGTGTGCAAGTTCACCAAGTGGATAAATGACACCATGAAAAAGCA 
TCGCTAACGCCACACTGAGTTAATTAACTGTGTGCTTCCAACAGAAAATGCACAGGAGTGAGGACGCCGATGACCTA 
TGAAGTCAAATTTGACTTTACCTTTCCTCAAAGATATATTTAAACCTCATGCCCTGTTGATAAACCAATCAAATTGG 
T AAAG AC C T AAAAC C AAAAC AAAT AAAG AAAC AC AAAAC C C T C AA 
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CCGAGACTCACGGTCAAGCTAAGGCGAAGAGTGGGTGGCTGAAGCCATACTATTTTATAGAATTAAJGGAAAGCAGA 
AAAGACATCACAAACCAAGAAGAACTTTGGAAAATGAAGCCTAGGAGAAATTTAGAAGAAGACGATTATTTGCATAA 
GGACACGGGAGAGACCAGCATGCTAAAAAGACCTGTGCTTTTGCATTTGCACCAAACAGCCCATGCTGATGAATTTG 
ACTGCCCTTCAGAACTTCAGCACACACAGGAACTCTTTCCACAGTGGCACTTGCCAATTAAAATAGCTGCTATTATA 
GCATCTCTGACTTTTCTTTACACTCTTCTGAGGGAAGTAATTCACCCTTTAGCAACTTCCCATCAACAATATTTTTA 
TAAAATTCCAATCCTGGTCATCAACAAAGTCTTGCCAATGGTTTCCATCACTCTCTTGGCATTGGTTTACCTGCCAG 
GTGTGATAGCAGCAATTGTCCAACTTCATAATGGAACCAAGTATAAGAAGTTTCCACATTGGTTGGATAAGTGGATG 
TTAACAAGAAAGCAGTTTGGGCTTCTCAGTTTCTTTTTTGCTGTACTGCATGCAATTTATAGTCTGTCTTACCCAAT 
GAGGCGATCCTACAGATACAAGTTGCTAAACTGGGCATATCAACAGGTCCAACAAAATAAAGAAGATGCCTGGATTG 
AGCATGATGTTTGGAGAATGGAGATTTATGTGTCTCTGGGAATTGTGGGATTGGCAATACTGGCTCTGTTGGCTGTG 
ACATCTATTCCATCTGTGAGTGACTCTTTGACATGGAGAGAATTTCACTATATTCAGAGCAAGCTAGGAATTGTTTC 
CCTTCTACTGGGCACAATACACGCATTGATTTTTGCCTGGAATAAGTGGATAGATATAAAACAATTTGTATGGTATA 
CACCTCCAACTTTTATGATAGCTGTTTTCCTTCCAATTGTTGTCCTGATATTTAAAAGCATACTATTCCTGCCATGC 
TTGAGGAAGAAGATACTGAAGATTAGACATGGTTGGGAAGACGTCACCAAAATTAACAAAACTGAGATATGTTCCCA 
GTTGTAGAATTACTGTTTACACACATTTTTGTTCAATATTGATATATTTTATCACCAACATTTCAAGTTTGTATTTG 
TTAATAAAATGATTATTCAAGGAAAAAAAAAAAAAAAAAA 
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GGTGGAGACACCGCCTCAGGGCTCGGTGCACAGTGGACATTTGGGGAGCGTTGTGGGTGACCCCCACACAGGCACTG 
GGAATGCAGGGGAGAGGGGGCCAAGGGGGAAAGGGGCCAGAGTGTTGGCTTTGGATTCAGGAGGGATGGATTCCAGT 
CCTAGCTTGCCACTTATTAGGACTCCTGAGAGCAGCCTCCATGAGGCCCTGGACCAGTGCATGACCGCCCTGGACCT 
CTTCCTCACCAACCAGTTCTCAGAAGCACTCAGCTACCTCAAGCCCAGAACCAAGGAAAGCATGTACCACTCACTGA 
CATATGCCACCATCCTGGAGATGCAGGCCATGATGACCTTTGACCCTCAGGACATCCTGCTTGCCGGCAACATGATG 
AAGGAGGCACAGATGCTGTGTCAGAGGCACCGGAGGAAGTCTTCTGTAACAGATTCCTTCAGCAGCCTGGTGAACCG 
CCCCACGCTGGGCCAATTCACTGAAGAGGAAATCCACGCTGAGGTCTGCTATGCAGAGTGCCTGCTGCAGCGAGCAG 
CCCTGACCTTCCTGCAGGGTTCCTCACACGGAGGGGCAGTCAGGCCCAGAGCCTTGCATGATCCCTCTCACGCCTGC 
AGCTGCCCACCTGGGCCAGGCCGTCAGCATCTTTTCCTCCTGCAGGACGAGAACATGGTGAGCTTCATCAAAGGCGG 
CATCAAAGTTCGAAACAGCTACCAGACCTACAAGGAGCTGGACAGCCTTGTTCAGTCCTCACAATACTGCAAGGGTG 
AGAACCACCCGCACTTTGAAGGAGGAGTGAAGCTTGGTGTAGGGGCCTTCAACCTGACACTGTCCATGCTTCCTACT 
AGGATCCTGAGGCTGTTGGAGTTTGTGGGGTTTTCAGGAAACAAGGACTATGGGCTGCTGCAGCTGGAGGAGGGAGC 
GTCAGGGCACAGCTTCCGCTCTGTGCTCTGTGTCATGCTCCTGCTGTGCTACCACACCTTCCTCACCTTCGTGCTCG 
GTACTGGGAACGTCAACATCGAGGAGGCCGAGAAGCTCTTGAAGCCCTACCTGAACCGGTACCCTAAGGGTGCCATC 
TTCCTGTTCTTTGCAGGGAGGATTGAAGTCATTAAAGGCAACATTGATGCAGTGAGTGATGGGGGTCCGGGCCGGGG 
CTGGGGATCCCTCGGGGTCTCCCAGACCAGCAGGAAGTCAGGCACATGTGACATACTCAGGGACAGGATAGACTGGG 
GGCGGGGGGGGGGCCAAGAGAGAACCAACCAGAGAGCAGGGGCAGGAGAGGCCCTTCTGGCAGAGCAGCCTGGGAAG 
ACAAGGGAGGAGGAGGCATTTGTGGTGCCTGGGATTTTGACTGGGAGATATAGGACTGCAGCATTGCAGTGGAGGGA 
GGTGGAGGGAGGTGCTTGAGGGAGGCAGAGGTTAGGAAAGCCCATCTGTTTAGGGCATGACGATTAGGCTGGAGTCT 
GGTACCTCCCCTCCATTATAGCTCTCTCCTGCTCTTTCATTTTGTTACTAAAAACCAGAGTCCTAGGCGGGGGCTGT 
ATTTGAGCCCAACGTCATGTAAGACTTAGGAGGTAAAACCAGGACTGGAGGCCAGATCTCCTGGCTCCTGGGGCCCC 
ACCTGAGCCTAGCACAGGGCTGGACCACTATGCCCTGGAGGAGTCCCGGTCTGCTGTGGTGTTGGGAGGTTCGGAGG 
ATGCAGAGGGGTTGGGGCTGGGTGGGCACCCGTCAGGCTGACCAGAAGGTGCCTGCAGGCCATCCGGCGTTTCGAGG 
AGTGCTGTGAGGCCCAGCAGCACTGGAAGCAGTTCCACCACATGTGCTACTGGGAGCTGATGTGGTGCTTCACCTAC 
AAGGGCCAGTGGAAGATGTCCTACTTCTACGCCGACCTGCTCAGCAAGGAGAACTGCTGGTCCAAGGTGGGCTGATG 
CCACGTGTTAGGGGCATTGGGTGACCAGGGCTGACTGTGTGCCTCCAGACCACGGGCCAAATCCCTAACTGAACACA 
GATGTCTCAGCTGGAATCTAAACATAACCTTAAATTCTAATAGGACTCAGGCTTGGAAGGAACTAAAGACCACAAGA 
GAAACTTCTGAACCACAATGTGTCACAAAGAGATTTCTAGTCACAAAGGACAGAAACATGGCTCCCTCTGTCCAGTA 
GAACTGTTTCTGTGGTAGAAATGTTCTACTTATGCACTGCCCAAATATGGTAGTCACCAGCCACATGTGCACAATGA 
GCACATGAAATGTGCCTAGTGCAACTGGGAAACTGATTGTTTTCAGTTTTATTTAATTTTAATTAACTAAATGTTAA 
ATTTAAATAGCCATGTAGGGCTTGTGGCCACTATATTGGACTATGCAGGTCCAAAACACAAAAGGCTCATATAACTG 
AACATTCTTGGCACATCCGACTTCAGGTAGGGCTGGATCCAGGAATTCAAATGATGTCGTCTGGCTTGGTCTTTCCA 
TTTGTGGCTGTGCTCTCTCCTATGACATCTTTGTTTCTGCTGCATCTTTGCAGGAAGGTTCTCTCCATGTGACAGGC 
AAGGTGGCCACGGGCTGCTTCTACTCATATCCTCCCCTTGGTTTCAACCAGAGTCCCACAGTTTGAGCCTCACTGGT 
CTGACTTGTCACCTGCCAATGCCTGGAACAGAGGGGTGGGAAGATTCCTCATGGGCTGAGAGTAGGAGAGGGGTGGT 
TCCACAGAAGAAAATGATGCACCCAGAAATAGTGGAGGGATTAACAAGATGCCATACAGGCAAAACAAAGCCAAACA 
GATGCCCGCCTACCAAAGATGAAATTTATCATGGTAAGTATTGAAATAAGTGTTAGCTTGTACCATCATAGTAATGA 
TAGTGCAGAAATTGGAACCAAGAGTCTTACAACCACCTAGCTCAGCAAACGTCTAATCTGTTATTTGTAAATACACA 
GGACATGTGTCTTCATGGCTTCATCCCCTGCCCATAGCAGACATTGCTAATCAATCCTCTGCCATGAGCCTAGCTGT 
GACCTTAGATACCTTCCCTGCAAAGCTCCAGGCAGCCGTTAGAACTGACCCACATTGGCACCTGTTAGGAGACCTGT 
CATCTGCATCTGGTCCCTCCCCCTCCTCTGCATGGAGGTCTCTTCCGTAGCCCTGCCTGGGGACCAAGCTGAAGGGG 
AATTCTCTGGGTCTGGGGCAGGTGGGGCTGGGGAAGGGAGCAGGCTTCTACTGAGCTCTCAACATGTCTGGCACCTG 
TCATGTCCTCCACACCACACCCAGTGAGGAGAAGTTATTATCTCCATTGGGCACATGAGGGCCACTGATGCTAAGCG 
AGGACTGTGACTTTCCCACCCACATGGCTGTTGGTGACAGACACCGGATTTGAGGACAGTCCTGTGTGACTCAGAGG 
CCTCTGCTTCTCTCTTGCTCCTAGAGTCCTCCTGAGGAGTCGGGGCTTGCCCTGAGCCCACCCTGCTGTTGAAGGTG 
CTTCCTCAGGCCCAGCTCCCATGGCCCCCACACCCCCTCCTCATCACCTCCTACTCCCAAAAAGGACAAAGCCTCAG 
GGAACCTTTTTTCTTTTTTTAGAGACAGGGTCTTGCTATGTTGGTCAGGCTAGTCTTGAACTCCTGGGCCCAAGCAA 
TCTTCCCACCTCTGCCTCCTAAAGTGCTGGGATTCTGGGCCTTAGGGAACCTTTTTGAACTGAAAGTGACTCTCGAA 
GCCTTCCTGTAGAGTGAGGTGGGTGGCTGGGGCTAACCATATGGGGAAGGAGAGACCTGGTGGGGGCACACAGCTGC 
TATATAGAGGAACAGAAGGTGGGCCAGGCCTCCCAGCTGTCAGGAAATCTGGCTCAGTCCCCAGACCGCTGTGTGGC 
TTTGGGGTGGCCAGTCCCTCTCTCTCTGCAACACTCTCCTCATGAGGACTTCTGTGAAAAATGGGGGTTGTAACCCT 
CTGAGTTCTGGGGCCCCACCCATTTCTAATCTCCAGAACTTGGCCACCACAGACTCCACCAACTTCTAG 
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TCCTGGGGCCTGGGCCTCTGGCCATTGCCATAGGCACCACCTGCTCTGTGCAGGCAGCGCCCCCCTCTGCCAGGATC 
CTCCGAGGTCAGCTGCTGGGTCTGACCCGCAGACCCTGGCTGAGCGACGGATGAACGGAGTATGCAGACACAGGCTT 
GCCTGTCAGCAGATGGGGGACCCCTGCCAGAGTCAGCAGCGGCCCCCATAAGCCTGCCACGCTTGCATTTATTTAGT 
ACAGATGTAATGACAAAGGCCTAAAGCAAACTCCATTTGTGGGTAATTAACATTGTCGCCCCCCCAGAAAGAGCAGT 
CCTCCGCATGATGATTAAAGGCCAGGTTCCGAGGCCTAAGTAAACCAACTTATCTAGATCAATTCCCTTACTTCTTG 
TTATCTACTCTGAGAGAATTCAGCTGCCTTCAGCCAAATCCTTTCCCGAAGCTTTTGCAAAACCTCCGAGCCTTCCA 
AGGTTTGCTTCTTTCTGTAATTTTTCTCACCACCCTGACCTATCTCCTGCAGTCAGCCCTGTGGAGGCCTTTGTGTT 
TCCCCCAGTGCTGGCAGCCTAGAGGCTGAGATGGCCAGAAACAAGGTGGTGACAGTGGCGTGCTCAGGGCTTGGGAA 
ACCCAAGGAGCTAAAGGCATGCCCAGGCAACCAAAGAGGACAGGAAGGCTTCTGAGGAGAGACCTCTGAGGTGGGTC 
TTGGAGAGGAAGGACTTAGGGAGGCAGAGTGGAGGAAGTGAGAGGACACCCCAAGCCAAGAGGGCGGCAGGACCAAA 
GGCTCAGAAGCCAGGGCGCTGCAGAGGGGCTGTGTGCCACAGGGTGAAGAGTTTGTGTGGCAGAAGGGCAGGGGGCT 
TGCATCAGGGGTGACAGCTGCTCTTTTGTCCCAGCATAGCCCCTGTACATCCCTGGAGAGCTGGGGCGTCCACAACT 
CTAAGTCACAGCCCCCATCCTAACCCTGGTGGTGCAGTGAGGGTGAGCTGTCTGTGGGCAGGAGGGAAGACTCTTGG 
AGATGAGCCTGGTGAAGGGATAATGGCATCCCGGGCCGAGGAGCAGCACAGGCAGAGGCCTGGGGAGAGTTTAAGGA 
GTGTAGGGGAGGAAATGGCAGAAGATGAGCCAGAAAAAGAAAGGTTAGGGCAGGTCCTGGAGGACATGAGTGGCTGT 
TTGGGCTTTATCCAGCAGTGGGGGAGCCTTGGCAGGCTTGTGGCTTAGATAGGTGCTTTAGAAAGCCCACCAGCAGT 
TGCTGGGCCACCCCGCTGGCTGGGTCCTGTTCTAAGGCAGGAAATACAAGCATGAGCAGGAAAAGACCCCCTCAAGG 
CTCACGTCCTAGTGGGGAGACAAGAAACACAGATGGGCAATATAACACGATGTCTGGTTCCAGTAAGTGCAGTGAAG 
AACAAGCGAGGCTGGATGCAGGGAGTGATGGGAGGGGCTTTGTAAGGGGAGGTCGGGGGAAGCCTGTCTCAGAGGAC 
ACCAGAATGGAGCGCAGGAGCAGCACGTGGCAGTCACATGGCAGGCCGTTAGGGCAGAGGGAGCTGGGCAGGGCACA 
GCAGGGCAGGAGTGTGTTTGATGTGTCCTGGGAACCGCCCTGAGGCCGTCGTGTGGCTGGAGTGCTGCAGGTGTCAA 
GGAAATTGTAGGAGATGTCTCCTGAGTGTGATGGAATATAACCAGATTTCCAGAAGGAACTGACATGATCTGACTTA 
AAAAGGTCAGTGTGCGAAATGGCTTGCAGGGGACAGGAGTGGGAGCAGGGAGATAGGAGACAATGTGTACCAGGACA 
GCAGAAAGACATCCCGGGTAGCCTGGAACAGGGAGACGGTGTGGAGATGGTGGCAGTCCGATAATGAGAGCCGTAGG 
GCAAGGCCAGCAGGATCCTAGAGTGAGACGGGAGGTAAAGTCACCGGGACTTGGTGTCTCCACGTCAGGGGCAGGGG 
AAAGGGAGAGGACAAGGGTGACCCGGGAGGTTAAAGATGGGACCGGGGCCAGACGCAGTGGCTCATGCCTGTAATCC 
TAGCACTTTGGGAGGCTGAGGCGGGCGGATAGCTTGAGGTCAGGAGTTTGAAACCAGCCTGGCCAACATGGTGAAAC 
CCCGTCTCTACTAAAATATACAAAAATTAGCCTGGCGTGGTGGTGCATGCCTGCAGTCCCAGCTATTCAGGAGGCTG 
AGGCAACAAGAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATCGCGCCATAGCACTCCAGCCTTAG 
C C T G G GC G AC AG AG C GAG AC C AC AT C 
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GAGCAGCGCGGCCTGACGGGACCAAGGCGGCGGGAGTCTGCGGTCGTTCCCTCGGCTGTGGACCGGGCGGCACGCAC 
GCGGTGCAGGGTAACATGGCGGATGCGGAAGTAATTATTTTGCCAAAGAAACATAAGAAGAAAAAGGAGCGGAAGTC 
AT T G C C AG AAG AAG AT GT AG C C G AAAT AC AAC AC G C T G AAG AAT T T C T TAT C AAAC C T G AAT C C AAAG T T G C T AAG T 
TGGACACGTCTCAGTGGCCCCTTTTGCTAAAGAATTTTGATAAGCTGAATGTAAGGACAACACACTATACACCTCTT 
GCATGTGGTTCAAATCCTCTGAAGAGAGAGATTGGGGACTATATCAGGACAGGTTTCATTAATCTTGACAAGCCCTC 
TAACCCCTCTTCCCATGAGGTGGTAGCCTGGATTCGACGGATACTTCGGGTGGAGAAGACAGGGCACAGTGGTACTC 
TGGATCCCAAGGTGACTGGTTGTTTAATCGTGTGCATAGAACGAGCCACTCGCTTGGTGAAGTCACAACAGAGTGCA 
GGCAAAGAGTATGTGGGGATTGTCCGGCTGCACAATGCTATTGAAGGGGGGACCCAGCTTTCTAGGGCCCTAGAAAC 
TCTGACAGGTGCCTTATTCCAGCGACCCCCACTTATTGCTGCAGTAAAGAGGCAGCTCCGAGTGAGGACCATCTACG 
AGAGCAAAATGATTGAATACGATCCTGAAAGAAGATTAGGAATCTTTTGGGTGAGTTGTGAGGCTGGCACCTACATT 
CGGACATTATGTGTGCACCTTGGTTTGTTATTGGGAGTTGGTGGTCAGATGCAGGAGCTTCGGAGGGTTCGTTCTGG 
AGTCATGAGTGAAAAGGACCACATGGTGACAATGCATGATGTGCTTGATGCTCAGTGGCTGTATGATAACCACAAGG 
ATGAGAGTTACCTGCGGCGAGTTGTTTACCCTTTGGAAAAGCTGTTGACATCTCATAAACGGCTGGTTATGAAAGAC 
AGTGCAGTAAATGCCATCTGCTATGGGGCCAAGATTATGCTTCCAGGTGTTCTTCGATATGAGGACGGCATTGAGGT 
CAATCAGGAGATTGTGGTTATCACCACCAAAGGAGAAGCAATCTGCATGGCTATTGCATTAATGACCACAGCGGTCA 
TCTCTACCTGCGACCATGGTATAGTAGCCAAGATCAAGAGAGTGATCATGGAGAGAGACACTTACCCTCGGAAGTGG 
GGTTTAGGTCCAAAGGCAAGTCAGAAGAAGCTGATGATCAAGCAGGGCCTTCTGGACAAGCATGGGAAGCCCACAGA 
CAGCACACCTGCCACCTGGAAGCAGGAGTATGTTGACTACAGTGAGTCTGCCAAAAAAGAGGTGGTTGCTGAAGTGG 
TAAAAGCCCCGCAGGTAGTTGCCGAAGCAGCAAAAACTGCGAAGCGGAAGCGAGAGAGTGAGAGTGAAAGTGACGAG 
ACTCCTCCAGCAGCTCCTCAGTTGATCAAGAAGGAAAAGAAGAAGAGTAAGAAGGACAAGAAGGCCAAAGCTGGTCT 
GGAGAGCGGGGCCGAGCCTGGAGATGGGGACAGTGATACCACCAAGAAGAAGAAGAAGAAGAAGAAAGCAAAAGAGG 
TAGAATTGGTTTCTGAGTAGTGAAGGCCACTTGAAGCTGGAGGAGAAACTAAAGCCTTATTGAGAAAACATGTTATA 
GATCCTTTTGTTGCTGAGAGAGTGGAACATAGGTCCTAGACAGGGTGAAGAGTTCTGGCACATTTTAGCTGCTACTT 
TGAGACCTCGGTGATGTTACCTGGTGTGGTCATCCCATCTTGTCCTGTTTTAAGGATATGGGTGGTGAAAGATGAAA 
GAGGCAGAGTTTATCCCAATGACTTCTCTGTTTGAGTTGGGAAGCCTCACCTTCAGACCCAGTAACTGTCCGCAGCT 
GTCTGCTAGTGGTTGTCTTAACATCGTAGTCCTAGTTTGCATTTTTTAAATCCCCTCTGTTTAAAAGGTTTGTAAAA 
CAAAAACAAAAAACTAAGTCTGCTCAGTGAAATGCTGTAGAACCCTAAATAAGTGGTAGAAGAGTGTCACTGAATTT 
TGTCTCTGAATTCAGTATAACTGAGTTTTGTCCATGCTGGTGTCTGGGTTATAGGCCTGATGGGCCTGGTAGTTTTC 
CATCTTGTTCTGGCCTAGAGGTCAGTCCTTTGCACTTCCTCAAAGCTTGTGTACAGTGCTCACCTAAATCCATCTGA 
CTACTTGTTCCTGTGCCCTCTTGTTTTAGGCCTCGTTTACTTTTAAAAAATGAAATTGTTCATTGCTGGGAGAAGAA 
TGTTGTAATTTTTACTTATTAAAGTCAACTTGTTAAGTTTTTTATGTATTCCTGTTGGGTTTTCTTGTTGATCTCAT 
GCTAGCAGAGCAAAAATTGTAAAATATTTTGATTAAAAATCTAGGGACCTTTATGTCCTATTTGGAATTCGATATCAA 
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GGCACGAGGCTACAAAGCAGGAAAGTATGCTTGGGAGAGGCCAAGTGAGTGGGGAATCAGCCCAAAGCCAGGCGTCC 
AGGGTCTCCCTCACCTGAAGCTGACTTTTTCCCCACCTTGGACAGAGGGCGGGAGATGCCATCCCCACTGAACCCAG 
TGCTTTCACCAGCCATATTAGCTCCCACTCACCCCCCGTCGTGGAAGCCTCGGCCGTCACACCTGCAGGGCCGGGGC 
GTGCATGGCCTCAGGGATGGCCTGTTCAGCTGCTGGGTGACTCGGGTCCAGGTGCCTCACCACCTGCTGAGCTCTGT 
GTGATTTCTGGACGCTTCTGCTCGTTGCCTTTGGGCTCAGTGAAGAGTCTGGAGTTTATCTGGAGTGAGGTGGCCGG 
TTCTTGGTGGGATCTGAGCAGGACAGCGTCTGGCTCCTTCCCTCGGCTC ATG GCCCTCAGAATCTGCGTCACATACA 
CCCCAGCTCTCCCGATAGGTCTCTGCACTCGCTGTTGCCTCTGCCTGGAACAGTCTCCCTCCTGGTGTCATTGTCTC 
CGTGGTGTGTCCTTCCTGACCTTCCACCTCCACCAGTCTGTCCCCCTTGGGGACAGGGACTCGTTGCTCATGTTCAC 
CCGGCAGGCTGGACACTTCGTGGAGGGCTCCAAAGCTGGCAGATCCCGGGGCCGCCTCTGTCTCTCCCAGGCCCTGC 
GTGTTGCGGTGAGAGGAGCATTTGTGTCTCTGTGGTTTGCTGCTGGAGCTGGTGACCGGGAGAGAAACAAGGGAGAC 
AAGGGTGCCCAGACAGGTGCGGGGCTCAGCCAGGAGGCAGAAGACGTGGACGTGTCCCGGGCCAGGAGGGTCACAGA 
TGCACCACAAGGCACTCTGTGTGGCACTGGGAACAGGAATTCTGGGAGTCAGTCTGCAAGGGCGGTGGGCGTTGCTC 
ACCTGGGAGAAGCCTTTAGAGTGGGCGTTGAGCAGGCCATTAGCTCGTGCCCTGAGGAGGTGCATGGGCGGCATGGG 
CTCTCCATGGAAATTATGTGGGCGCGAATGGATGTGGCTCTGCGCTCACCTGGGCGAGGACTTCTGGCCGGTGCCGG 
GGCACTCTGCGTGACCCTGGCAGAATCGAGCTGCCCTGACTATGAAAGGGGAAGAAGAGCATGCCTGACCCTCCACC 
GGCACCCCACCCCTCACTGCTCCACCTGGGGCCTGCCTCTGCGGGTGGCTGGGTCCTGGCTGACTGTTGTGACTGTT 
GAGGCCCTGGGGGGGTGGCGCATGGGAGTTAGGAGGACTGGCCAGGTGGGGCCCACTATGCACCCACCCCCAGTGTC 
AGGTGCTTCTCCTCTCCTCCTCCACCACCTCCTCCTCCTCCTCCTCATCATCATCCTCACTTGT TGA GGACGTCCTG 
TGTGCCAAGTGGTTTATATGCCCAGCCTCATTTAATCCTCAGAATGACTCCATGAGGTAGCTACTAAAACCCCCCAC 
TTAACAGATGAGGAAACTGAGGCCTAGAGAAGCTCAATAAGTTGCCTAAGTTCCAAGTTTCCTCTCCCAACTCTCCT 
ACCCTCTCCTCTTCCTTCTCCTTTCTCCCATTCTTCCCTGCCTCTTCCCTAACTAGACAATTTTTTATTGAGTGTCT 
CCCAGGTGCAGGCATGAGCCAGGTGCTGGGAAAATCATGATAACCCAGCTCCTTCTGGTCATTTTCTCAGCTGGTTA 
GAGGCTGGGAGGACACGCAAGTTCAGCTCCAGCCGACTGGGGCATTGGTGGTAGCCCCTGGAGACATTGTGCAATGG 
GGCTACGAGGCTGCATCTGGCTCCAGGGAAGCGTGTTGCAATCCATGAGTGATGTCTGCCATGCGTACAGGCATGGA 
GAGTGAGGCGCCTGTACTGTCTTTCTGTAGACCCTAGACTGGTGGGGCCTCTGAAATGCATCCAGACACTGTGCTGG 
GTGTGTTGCATGGCCCTCCCAACCAATTCAGTATTTTTCTCCCCATTTTCCAGGGAGAAATCTAAGGCGTCAGAATG 
TAAGGTTCTTATTGGAACCCAGGCTCCAGGGTCCCTGGTTTTCTGTGACATCATGCTGCAGGACCCTGTTTCTCTCT 
TTGCTTTGGCTGCTGGGGAGCTCAGAAGGGAGCTTTAGGCTTGCTCTCAGTCACCACATTAGCTCAGGGGTTTGGGC 
ATTTTTGTCGCCGTCTCCTTTGGGCTCTGTCTCCTCCCTGCTGTGCTTCCTGAGGAGCAGGCCGGATGTAAGTTATC 
AACTATAAATAAAACCAAGACTTCCGTTCTGGCTCTCAAAAAAAAAAAAAAAAAAA 
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GTACAGAAGCAAAATCAAACCTGCTATTTCAGCACTCCTGTTTTTAACTTGGTGTCTTTAGTGCTTGGATTGGTGGG 
ATGTTTCGGAATGGGCATTGTCGCCAATTTTCAGGAGTTAGCTGTGCCAGTGGTTCATGACGGGGGCGCTCTTTTGG 
CCTTTGTCTGTGGTGTCGTGTACACGCTCCTACAGTCCATCATCTCTTACAAATCATGTCCCCAGTGGAACAGTCTC 
TCGACATGCCACATACGGATGGTCATCTCTGCCGTTTCTTGCGCAGCTGTCATCCCCATGATTGTCTGTGCTTCACT 
AATTTCCATAACCAAGCTGGAGTGGAATCCAAGAGAAAAGGATTATGTATATCACGTAGTGAGTGCGATCTGTGAAT 
GGACAGTGGCCTTTGGTTTTATTTTCTACTTCCTAACTTTCATCCAAGATTTCCAGAGTGTCACCCTAAGGATATCC 
ACAGAAATCAATGGTGATATTTGAAGAAAGAAGAATTCAGTCTCACTCAGTGAATGTCGCAGGCCATTTCTAAAAGT 
GCTACAGAGGACAGACAGGGTTTTGAGGCCACCCTGATTATTGGGATGCATCTGCAGCACATCCAGGACTTGAATTT 
CATTACGAGTTCCTAATAGTTGTATTTCTAAAGATGTGTTTCCTAGAGAATGTACAGCCTTATGACACTGTAGTGAT 
GTTTTTATAATTTTCTAAGTAGATTTTTTTATATTAACAAATTCATATACAGAAAAAATAAGGTGTTACAAAAAATG 
GAGAGCTCTTATTTTTGTACAGATTCTGTCGTTTTTTTTTATTTGTGTGAGATTTATGGAAATACACTAAATGAGTA 
ATTCAGGTTCAGTACATTTATTACAAAGTGAAATCAGGGGATATTCATTTGTAAATTTTATTCTTAGTGAATGAACT 
GTATAATTTTTTTTATCAGGAGAGCACTTATAAAATTCAATTTATAAAGATCATATACCCAAATCATAAAGATTTAG 
TTGATACATTAACACTAAGATACTCTGATTTTTAGCGAACTAAACAAAGTGCTTCTACTGAGAGGCCTTTATACCAC 
CATGTACAGTAACTCTAAGTGAATACGGAAGACCTTGGTTTTGAAATTCTGCCACCTTGTTTCTCCCTGCTCATGAG 
GTCGCACCTTTTGCTCTTGCTGCTAATTGCCCATTCGTAGTGGGTGTAATGCCAGGTGGAATGGTTTCAACAAGTCA 
GGTGAAAACCATCCTTTATTGTTGCTGGCACAACTTGATATATAGTCTGACTCAGAACTGAAGCTCACATCTCAAAT 
TCATTTCATGCCAGTAAATGTGGCAAAGAGAAGAAAGGCCCAAGAGCGAGACAAGAAGAATGGAGAAGGGGGCAGCC 
AAGAAGAACTTCTGGGTTCAGGGTACTGTTTATTTGCTCCTTCTCTTCATGCCTGTGGCTGGATGTCCCACAACACT 
ATAAGAAATATAAGTCAAGCCCTTTGTGTTAAGCAAGAACTACAGACTCCATCTTTTCACCCAAATCATGAATGACC 
AATAAAAAGCAAGTTATTCCAGAGGAAGAAGCAGCCCTTGAAATGTTAAGGCTTAGGCTTGAAAGGTGAAGAGCAGG 
AATTCTCTCTTTCAAATCCTAGAGCATAAACCCATGTGTGGCCAAGTGAGATCAGCCCTCAAGGGCACATGCCAAGG 
GCAGAGCAGCCCATGTAGACAGCTTCGGAGGGCATGGGGGTGTAGGGAGTTCGGGGTAGCTCCTCATTAACTATTTG 
TTGGGTGAGTAAAGGGGTGAGGCTCAGTGGCAGGTACCTCTGCAATGACAAGCTGCCTCCCCTCTATGTGTTTAGCA 
TATGTTATTAGAACATGTCCGACACCCCTACCGCTGCCATTTGGGCCCTTTAATAAAGCCAAGTAGAGAAATCTGGC 
AATAAAAGGCAAATGTAAGCATGCTTTCTTTAAGACGCATCATAAATGGTTTTCTTTAAGTGAATGGAAGAGTTTGA 
CAGAGATACACCTTTGTAAGAAAACATTAAGAATGCTGGCTGGCTGTGGTGGCTCACACCTGTATTCCCAGCACTTT 
GGGAGGCCTAGGCAGGAGGATTGCTTGAGCCTGGGACTTCGAGACCAGACTGGGAAACATGGCAAAATCCCATCTCT 
ACAACAAAAATACAAAAATTAGCCAAGTGCGGTGGTGTGCCTGTAGTCCTAGTTACTTGGGAGGCTGAGGTGGGAGA 
ATCACCTGAGCCCAGGAGGTGGAGGCTGCAGTGAGCCATGCCAATGCACTCCAGTCTGGGCAACAGAGTGAGACCCT 
GTCTCAAAAATAAATAAATAAATAAATGAATAAAGAGAATGCTAATCATTTCTGGGTTCACTGCGACTCACTGTAGT 
GCTGGGGATCCCCCTTGTAACACTGGAACTGAAAGGCAGTGATGAAAGCTATGTCAAGCATTCATTATTCTGAAGAG 
GAGGAGAAATGCCACATACCTTTCCCATGGGACCTGTGGTGGAATGAATCCATACTTCTGCCTCACTTCGAGCAGAC 
TTTTGTTCTCGGCGCTCCTCACGATGGAGTTTCATGCTTCATTTTCACATCTCTCTGCACAATTAGATTGGGAGCTC 
CTTGAGGGCAGAGTACGTGCCTTAATCTTTATCTTTGTAATGCCACAATGAACAGAGTGCCTCCTGGTACACTGTAG 
GAGCTTAAGAAATACTCACTGAATGCATGAATGAATGAATGAACAAATGAAGGAATGACTAAGGATGTTTGTAGTGC 
TATAATATAGAATGGGATTTACTCTGCTTTACCAGTTAGTTTCATAATAAACAAATAGTCTGT 
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GGCACCGATTCGGGGCCTGCCCGGACTTCGCCGCACGCTGCAGAACCTCGCCCAGCGCCCACCATGCCCCGGCAGCT 
CAGCGCGGCGGCCGCGCTCTTCGCGTCCCTGGCCGTAATTTTGCACGATGGCAGTCAAATGAGAGCAAAAGCATTTC 
CAGAAACCAGAGATTATTCTCAACCTACTGCAGCAGCAACAGTACAGGACATAAAAAAACCTGTCCAGCAACCAGCT 
AAGCAAGCACCTCACCAAACTTTAGCAGCAAGATTCATGGATGGTCATATCACCTTTCAAACAGCGGCCACAGTAAA 
AATTCCAACAACTACCCCAGCAACTACAAAAAACACTGCAACCACCAGCCCAATTACCTACACCCTGGTCACAACCC 
AGGCCACACCCAACAACTCACACACAGCTCCTCCAGTTACTGAAGTTACAGTCGGCCCTAGCTTAGCCCCTTATTCA 
CTGCCACCCACCATCACCCCACCAGCTCATACAGCTGGAACCAGTTCATCAACCGTCAGCCACACAACTGGGAACAC 
CACTCAACCCAGTAACCAGACCACCCTTCCAGCAACTTTATCGATAGCACTGCACAAAAGCACAACCGGTCAGAAGC 
CTGATCAACCCACCCATGCCCCAGGAACAACGGCAGCTGCCCACAATACCACCCGCACAGCTGCACCTGCCTCCACG 
GTTCCTGGGCCCACCCTTGCACCTCAGCCATCGTCAGTCAAGACTGGAATTTATCAGGTTCTAAACGGAAGCAGACT 
CTGTATAAAAGCAGAGATGGGGATACAGCTGATTGTTCAAGACAAGGAGTCGGTTTTTTCACCTCGGAGATACTTCA 
ACATCGACCCCAACGCAACGCAAGCCTCTGGGAACTGTGGCACCCGAAAATCCAACCTTCTGTTGAATTTTCAGGGC 
GGATTTGTGAATCTCACATTTACCAAGGATGAAGAATCATATTATATCAGTGAAGTGGGAGCCTATTTGACCGTCTC 
AGATCCAGAGACAGTTTACCAAGGAATCAAACATGCGGTGGTGATGTTCCAGACAGCAGTCGGGCATTCCTTCAAGT 
GCGTGAGTGAACAGAGCCTCCAGTTGTCAGCCCACCTGCAGGTGAAAACAACCGATGTCCAACTTCAAGCCTTTGAT 
TTTGAAGATGACCACTTTGGAAATGTGGATGAGTGCTCGTCTGACTACACAATTGTGCTTCCTGTGATTGGGGCCAT 
CGTGGTTGGTCTCTGCCTTATGGGTATGGGTGTCTATAAAATCCGCCTAAGGTGTCAATCATCTGGATACCAGAGAA 
TCTAATTGTTGCCCGGGGGGAATGAAAATAATGGAATTTAGAGAACTCTTTCATCCCTTCCAGGATGGATGTTGGGA 
AATTCCCTCAGAGTGTGGGTCCTTCAAACAATGTAAACCACCATCTTCTATTCAAATGAAGTGAGTCATGTGTGATT 
TAAGTTCAGGCAGCACATCAATTTCTAAATACTTTTTGTTTATTTTATGAAAGATATAGTGAGCTGTTTATTTTCTA 
GTTTCCTTTAGAATATTTTAGCCACTCAAAGTCAACATTTGAGATATGTTGAATTAACATAATATATGTAAAGTAGA 
ATAAGCCTTCAAATTATAAACCAAGGGTCAATTGTAACTAATACTACTGTGTGTGCATTGAAGATTTTATTTTACCC 
TTGATCTTAACAAAGCCTTTGCTTTGTTATCAAATGGACTTTCAGTGCTTTTACTATCTGTGTTTTATGGTTTCATG 
TAACATACATATTCCTGGTGTAGCACTTAACTCCTTTTCCACTTTAAATTTGTTTTTGTTTTTTGAGACGGAGTTTC 
ACTCTTGTCACCCAGGCTGGAGTACAGTGGCACGATCTCGGCTTATGGCAACCTCCGCCTCCCGGGTTCAAGTGATT 
CTCCTGCTTCAGCTTCCCGAGTAGCTGGGATTACAGGCACACACTACCACGCCTGGCTAATTTTTGTATTTTTATTA 
TAGACGGGTTTCACCATGTTGGCCAGACTGGTCTTGAACTCTTGACCTCAGGTGATCCACCCACCTCAGCCTCCCAA 
AGTGCTGGGATTACAGGCATGAGCCATTGCGCCCGGCCTTAAATGTTTTTTTTAATCATCAAAAAGAACAACATATC 
TCAGGTTGTCTAAGTGTTTTTATGTAAAACCAACAAAAAGAACAAATCAGCTTATATTTTTTATCTTGATGACTCCT 
GCTCCAGAATTGCTAGACTAAGAATTAGGTGGCTACAGATGGTAGAACTAAACAATAAGCAAGAGACAATAATAATG 
GCCCTTAATTATTAACAAAGTGCCAGAGTCTAGGCTAAGCACTTTATCTATATCTCATTTCATTCTCACAACTTATA 
AGTGAATGAGTAAACTGAGACTTAAGGGAACTGAATCACTTAAATGTCACCTGGCTAACTGATGGCAGAGCCAGAGC 
TTGAATTCATGTTGGTCTGACATCAAGGTCTTTGGTCTTCTCCCTACACCAAGTTACCTACAAGAACAATGACACCA 
CACTCTGCCTGAAGGCTCACACCTCATACCAGCATACGCTCACCTTACAGGGAAATGGGTTTATCCAGGATCATGAG 
ACATTAGGGTAGATGAAAGGAGAGCTTTGCAGATAACAAAATAGCCTATCCTTAATAAATCCTCCACTCTCTGGAAG 
GAGACTGAGGGGCTTTGTAAAACATTAGTCAGTTGCTCATTTTTATGGGATTGCTTAGCTGGGCTGTAAAGATGAAG 
GCATCAAATAAACTCAAAGTATTTTTAAATTTTTTTGATAATAGAGAAACTTCGCTAACCAACTGTTCTTTCTTGAG 
TGTATAGCCCCATCTTGTGGTAACTTGCTGCTTCTGCACTTCATATCCATATTTCCTATTGTTCACTTTATTCTGTA 
GAGCAGCCTGCCAAGAATTTTATTTCTGCTGTTTTTTTTGCTGCTAAAGAAAGGAACTAAGTCAGGATGTTAACAGA 
AAAGTCCACATAACCCTAGAATTCTTAGTCAAGGAATAATTCAAGTCAGCCTAGAGACCATGTTGACTTTCCTCATG 
TGTTTCCTTATGACTCAGTAAGTTGGCAAGGTCCTGACTTTAGTCTTAATAAAACATTGAATTGTAGTAAAGGTTTT 
TGC AAT AAAAAC T T AC T T T GG 
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CAGAGGTAGCCTGAAAGAAGCAGGAACTCCAGGATCCCAAACCAGAGCAGACCCTATAGTAAAGTATTTTTACATCT 
TTTCCTTTCCCCAGAAGAGATCCCTAACCTATTGTTTTATTGACAGCCTTGCTGTTAGAGGCTCTTTCCCAGAAGTT 
GGACGAAGAGGCTCAGGCGTTGCTGTTTCTTGTCTTCCAAGTCAAGTGGTTACTCTGGTAATGGATTGCCTCTCTCC 
GAGCTTTCACCCTGGTGAGACTGTCCAGATCTAGTCTGTAAACCCAGCTTAGAAGCACTGTTGTAAAAATGACTGAA 
GAGCCCATCAAGGAGATCCTGGGAGCCCCAAAGGCTCACATGGCAGCGACGATGGAGAAGAGCCCCAAGAGTGAAGT 
TGTGATCACCACAGTCCCTCTGGTCAGTGAGATTCAGTTGATGGCTGCTACAGGGGGTACCGAGCTCTCCTGCTACC 
GCTGCATCATCCCCTTTGCTGTGGTTGTCTTCATCGCCGGCATCGTGGTCACCGCGGTGGCTTACAGCTTCAATTCC 
CATGGGTCTATTATCTCCATCTTTGGCCTGGTTGTTCTGTCATCTGGACTTTTTTTACTAGCCTCCAGTGCCTTGTG 
CTGGAAAGTGAGACAAAGGAGCAAGAAAGCCAAGAGACGGGAGAGTCAAACAGCTCTCGTGGCAAATCAGAGAAGCT 
TGTTTGCTTGAGACTGAATACGACCAAATGGGCCATTGGGCCTGGAAAACGTGCTCTGACTTTGTCACCCAATTCAC 
CCAGAACCATGGTGGGAGAGAACAGACTTGGCGTTGGAGCAGACTGGAAGAATGGGGGTGGGAGGGTGGAGGGGCTT 
CTCCTTTGTGAGGAATGACTCATGTCTTCTTTAACGACAAACTTAACCCTAAGGGCTACTTCTGAGACTGAAAAATC 
AGCTTTCTATTTACATGAAACACTTTGGGGGTCATGGGAGTGCACAGCATTAGACAGTATTTGGTTCACCCTGTAAA 
GTAGCCAAGAAAAGATGAGAAAAATCAAGATAGGCCTGGCACACTAGACATTTGCCTCCAAAAGAAATAACCTACAG 
TCTTAAGATGTATCATAAAAATGTTCTGCCAAGGATCTAAATTACCTTGGGTTTCGCATATGTCTATGAAATTCTGT 
GATAATTTTTTTCAATACATTGATTCACTGGCGTCTGTTTTCATTTTATACTTTTAATAACTCATCACTGGTGGTAC 
TTTATCTTGAAAAGTAATATTTTTTATATTTTAACATTGGACAGTGTTAGCCAGTTGTAATGATGTATCAGAAGTAA 
AGAAAAACCCATTAAAGTTATAGCTAATAGATGCTGTTGGGGGTTAAATTAATAGTAAAATAATCCAATATAGCACT 
TTTGATGATTTTTATATAAAAGTCAACTGTACATTTCATTCAGAATAATAAATACTTATTGCTGCTAAAACTTCTTA 
AATGGTTGTTTCTGCTATAGTTATTTCTATTGCAGTTCCAAATTGCCATCTTCCCTTGTCTCATTTGCAAGTTCTCA 
ATTGTATTTCTCTCAAATGGACAGGTTCCTTCTTTACTGGAGGATTTTTGTTTTTATCATATTGGTTTTTCATTACT 
TCTGAATAGTCTTAATTACGTTTACTAAATTCTAAAGGATTTCTGTGCTATTATAATTAGGAAATCAACGTCTTTGG 
TCAGGAACTTTATAATGTGCTATTAAATGTATATTACATTTTTGTGG 
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ATCCTGTCACTGCTCCACGCATCAACGCTGGCAGTCCTTGGGGCTCTGTGTGTATATGGTGCAGGTCACCTAGAGCA 

ACCTCAAATTTCCAGTACTAAAACGCTGTCAAAAACAGCCCGCCTGGAATGTGTGGTGTCTGGAATAACAATTTCTG 

CAACATCTGTATATTGGTATCGAGAGAGACCTGGTGAAGTCATACAGTTCCTGGTGTCCATTTCATATGACGGCACT 

GTCAGAAAGGAATCCGGCATTCCGTCAGGCAAATTTGAGGTGGATAGGATACCTGAAACGTCTACATCCACTCTCAC 

CATTCACAATGTAGAGAAACAGGACATAGCTACCTACTACTGTGCCTTGTGGGAGGTGCGGCTAGCCAACCAAGAGT 

TGGGCAAAAAAATCAAGGTATTTGGTCCCGGAACAAAGCTTATCATTACAGATAAACAACTTGATGCAGATGTTTCC 

CCCAAGCCCACTATTTTTCTTCCTTCAATTGCTGAAACAAAGCTCCAGAAGGCTGGAACATACCTTTGTCTTCTTGA 

GAAATTTTTCCCTGATGTTATTAAGATACATTGGCAAGAAAAGAAGAGCAACACGATTCTGGGATCCCAGGAGGGGA 

ACACCATGAAGACTAACGACACATACATGAAATTTAGCTGGTTAACGGTGCCAGAAAAGTCACTGGACAAAGAACAC 

AGATGTATCGTCAGACATGAGAATAATAAAAACGGAGTTGATCAAGAAATTATCTTTCCTCCAATAAAGACAGATGT 

CATCACAATGGATCCCAAAGACAATTGTTCAAAAGATGCAAATGATACACTACTGCTGCAGCTCACAAACACCTCTG • 

CATATTACACGTACCTCCTCCTGCTCCTCAAGAGTGTGGTCTATTTTGCCATCATCACCTGCTGTCTGCTTAGAAGA 

ACGGCTTTCTGCTGCAATGGAGAGAAATCATAACAGACGGTGGCACAAGGAGGCCATCTTTTCCTCATCGGTTATTG 

TCCCTAGAAGCGTCTTCTGAGGATCTAGTTGGGCTTTCTTTCTGGGTTTGGGCCATTTCAGTTCTCATGTGTGTACT 

ATTCTATCATTATTGTATAACGGTTTTCAAACCAGTGGGCACACAGAGAACCTCACTCTGTAATAACAATGAGGAAT 

AGCCACGGCGATCTCCAGCACCAATCTCTCCATGTTTTCCACAGCTCCTCCAGCCAACCCAAATAGCGCCTGCTATA 

GTGTAGACATCCTGCGGCTTCTAGCCTTGTCCCTCTCTTAGTGTTCTTTAATCAGATAACTGCCTGGAAGCCTTTCA 

TTTTACACGCCCTGAAGCAGTCTTCTTTGCTAGTTGAATTATGTGGTGTGTTTTTCCGTAATAAGCAAAATAAATTT 

AAAAAAATG 
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GGGACAGGGCTGAGGATGAGGAGAACCCTGGGGACCCAGAAGACCGTGCCTTGCCCGGAAGTCCTGCCTGTAGGCCT 
GAAGGACTTGCCCTAACAGAGCCTCAACAACTACCTGGTGATTCCTACTTCAGCCCCTTGGTGTGAGCAGCTTCTCA 
ACATGAACTACAGCCTCCACTTGGCCTTCGTGTGTCTGAGTCTCTTCACTGAGAGGATGTGCATCCAGGGGAGTCAG 
TTCAACGTCGAGGTCGGCAGAAGTGACAAGCTTTCCCTGCCTGGCTTTGAGAACCTCACAGCAGGATATAACAAATT 
TCTCAGGCCCAATTTTGGTGGAGAACCCGTACAGATAGCGCTGACTCTGGACATTGCAAGTATCTCTAGCATTTCAG 
AGAGTAACATGGACTACACAGCCACCATATACCTCCGACAGCGCTGGATGGACCAGCGGCTGGTGTTTGAAGGCAAC 
AAGAGCTTCACTCTGGATGCCCGCCTCGTGGAGTTCCTCTGGGTGCCAGATACTTACATTGTGGAGTCCAAGAAGTC 
CTTCCTCCATGAAGTCACTGTGGGAAACAGGCTCATCCGCCTCTTCTCCAATGGCACGGTCCTGTATGCCCTCAGAA 
TCACGACAACTGTTGCATGTAACATGGATCTGTCTAAATACCCCATGGACACACAGACATGCAAGTTGCAGCTGGAA 
AGCTGGGGCTATGATGGAAATGATGTGGAGTTCACCTGGCTGAGAGGGAACGACTCTGTGCGTGGACTGGAACACCT 
GCGGCTTGCTCAGTACACCATAGAGCGGTATTTCACCTTAGTCACCAGATCGCAGCAGGAGACAGGAAATTACACTA 
GATTGGTCTTACAGTTTGAGCTTCGGAGGAATGTTCTGTATTTCATTTTGGAAACCTACGTTCCTTCCACTTTCCTG 
GTGGTGTTGTCCTGGGTTTCATTTTGGATCTCTCTCGATTCAGTCCCTGCAAGAACCTGCATTGGAGTGACGACCGT 
GTTATCAATGACCACACTGATGATCGGGTCCCGCACTTCTCTTCCCAACACCAACTGCTTCATCAAGGCCATCGATG 
TGTACCTGGGGATCTGCTTTAGCTTTGTGTTTGGGGCCTTGCTAGAATATGCAGTTGCTCACTACAGTTCCTTACAG 
CAGATGGCAGCCAAAGATAGGGGGACAACAAAGGAAGTAGAAGAAGTCAGTATTACTAATATCATCAACAGCTCCAT 
CTCCAGCTTTAAACGGAAGATCAGCTTTGCCAGCATTGAAATTTCCAGCGACAACGTTGACTACAGTGACTTGACAA 
TGAAAACCAGCGACAAGTTCAAGTTTGTCTTCCGAGAAAAGATGGGCAGGATTGTTGATTATTTCACAATTCAAAAC 
CCCAGTAATGTTGATCACTATTCCAAACTACTGTTTCCTTTGATTTTTATGCTAGCCAATGTATTTTACTGGGCATA 
CTACATGTATTTTTGAGTCAATGTTAAATTTCTTGCATGCCATAGGTCTTCAACAGGACAAGATAATGATGTAAATG 
GTATTTTAGGCCAAGTGTGCACCCACATCCAATGGTGCTACAAGTGACTGAAATAATATTTGAGTCTTTCTGCTCAA 
AGAATGAAGCTCCAACCATTGTTCTAAGCTGTGTAGAAGTCCTAGCATTATAGGATCTTGTAATAGAAACATCAGTC 
CATTCCTCTTTCATCTTAATCAAGGACATTCCCATGGAGCCCAAGATTACAAATGTACTCAGGGCTGTTTATTCGGT 
GGCTCCCTGGTTTGCATTTACCTCATATAAAGAATGGGAAGGAGACCATTGGGTAACCCTCAAGTGTCAGAAGTTGT 
TTCTAAAGTAACTATACATGTTTTTTACTAAATCTCTGCAGTGCTTATAAAATACATTGTTGCCTATTTAGGGAGTA 
ACATTTTCTAGTTTTTGTTTCTGGTTAAAATGAAATATGGGCTTATGTCAATTCATTGGAAGTCAATGCACTAACTC 
AATACCAAGATGAGTTTTTAAATAATGAATATTATTTAATACCACAACAGAATTATCCCCAATTTCCAATAAGTCCT 
ATCATTGAAAATTCAAATATAAGTGAAGAAAAAATTAGTAGATCAACAATCTAAACAAATCCCTCGGTTCTAAGATA 
CAATGGATTCCCCATACTGGAAGGACTCTGAGGCTTTATTCCCCCACTATGCATATCTTATCATTTTATTATTATAC 
ACACATCCATCCTAAACTATACTAAAGCCCTTTTCCCATGCATGGATGGAAATGGAAGATTTTTTTGTAACTTGTTC 
TAGAAGTCTTAATATGGGCTGTTGCCATGAAGGCTTGCAGAATTGAGTCCATTTTCTAGCTGCCTTTATTCACATAG 
TGATGGGGTACTAAAAGTACTGGGTTGACTCAGAGAGTCGCTGTCATTCTGTCATTGCTGCTACTCTAACACTGAGC 
AACACTCTCCCAGTGGCAGATCCCCTGTATCATTCCAAGAGGAGCATTCATCCCTTTGCTCTAATGATCAGGAATGA 
TGCTTATTAGAAAACAAACTGCTTGACCCAGGAACAAGTGGCTTAGCTTAAGTAAACTTGGCTTTGCTCAGATCCCT 
GATCCTTCCAGCTGGTCTGCTCTGAGTGGCTTATCCCGCATGAGCAGGAGCGTGCTGGCCCTGAGTACTGAACTTTC 
TGAGTAACAATGAGACACGTTACAGAACCTATGTTCAGGTTGCGGGTGAGCTGCCCTCTCCAAATCCAGCCAGAGAT 
GCACATTCCTCGGCCAGTCTCAGCCAACAGTACCAAAAGTGATTTTTGAGTGTGCCAGGGTAAAGGCTTCCAGTTCA 
GCCTCAGTTATTTTAGACAATCTCGCCATCTTTAATTTCTTAGCTTCCTGTTCTAATAAATGCACGGCTTTACCTTT 
CCTGTCAGAAATAAACCAAGGCTCTAAAAGATGATTTCCCTTCTGTAACTCCCTAGAGCCACAGGTTCTCATTCCTT 
TTCCCATTATACTTCTCACAATTCAGTTTCTATGAGTTTGATCACCTGATTTTTTTAACAAAATATTTCTAACGGGA 
ATGGGTGGGAGTGCTGGTGAAAAGAGAT GAAATGTGGTTGTATGAGCCAATCATATTTGTGATTTTTTAAAAAAAGT 
TTAAAAGGAAATATCTGTTCTGAAACCCCACTTAAGCATTGTTTTTATATAAAAACAATGATAAAGATGTGAACTGT 
GAAATAAATATACCATATTAGCTACCCACCAAAAAAAAAAAAAAAAAA 
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TGCTGTCAGGGCGCTCAAGCTGCCGGGGAAGAAGAGCCCAGACCTAGGGGAGTATGATCCACTTACCCAGGCTGACAGTGATGAGA 
GCGAAGACGATCTGGTGCTTAACCTGCAGAAGAATGGAGGGGTCAAAAATGGGAAGAGTCCTTTGGGAGAAGCGCCAGAACCCGAC 
TCAGATGCTGAGGTTGCAGAGGCTGCAAAGCCACATCTTTCAGAAGTCACCACGGAGGGCTACCCCTCAGAACCCCTTGGGGGCCT 
GGAACAGAAGGCGGCCTCCTCCCTGGTGTCATATGTGCGCACGTCTGTCTTCCTGCTGACTTTGGGGATCTCG ATG ATCCTGGTGC 
TCCTGTGTGCTTTCCTGATCCCCTGTCCTCCCAGAGATCTGCACAGCACCTGGAGCCGCCACTTGGGCTCCCAGGGAGGTGGGGAC 
CTGTCTCCATTGGAATTGGCTGATGTGAATGGAGATGGCCTGCGTGATGTGCTTCTCTCCTTTGTGATGTCAAGGAACGGGAGTGC 
AGTAGGTGTCTCAAGACCAGCTGCTAATCTTGTGTGCCTTTCGGGGATGAATGGCAGCACACTGTGGTCTAGTCTTCTCCCTGAGG 
AGGCTCGAGATATCACATGTTTGGAGCTGATGCCAGGAAGCTTGGCTGAAACCATCTGCCTTGTGACAGGGACACACAAGATGCTC 
AGCGCATTCAATGCAACGTCAGGGAAAGCCATTTGGACTTTAAACCCAAACTACTTGTCCAACGGTACCTTGGCTGCCCCAGTTGT 
GGTACTGCCAGACTTGGATGAAGACGGTGTTCGAGACCTTGTGGTTCTGGCCATTGGGGAATTGCAGCCAGATCTGTGCTTTCTGC 
TGGTGTCTGGCCGGACCGGAAATCCAGTGGGTCGACCTGTGAAGTACAACATCGTTGGAGTTGGGAATCTGATTGGTCCTCAGGTT 
TACATCACCACAAATGGGGCTGTCTACATCCTGTTTGGCTTTGGAAATATACAAGCTGTCGCACTGCGGGACATTTTTGTTCAGGC 
CCAAAATCGAGACAGCTCACCACCTTCTCTGCAGATAGAAGAGCCAGAATGGGAAAAGCGAAGATCCATCAACCTGTCTGAGCTCA 
TTGATGTTTACAGTGATGGTGTTGAACTACTCCAGATGGTGAAGGCACCAGATTCCAACTGCAGCAACCTTCTGATTACAACCAGA 
CAAAGCCTTGTGCTGCTTCGGGGGCAAAATCTGACACCTTACTGGGCATTGAGACTTCAAGGCCTGCGCAGCCAGCCTACTCCTGG 
ATATTTCACTGATGATCAGACATTAGATTTCCTTCTGCAGATACAGGATGGAGTTGGGATGAAAAAGATGATGGTTGTGGATGGTG 
ACTCTGGCTCCATTGTTTGGAGTTACCGTGCTCCGTGTCACATGAAAGAAACGCCAGCCACCTCAGCAGTTACTTCAGACCAGAAG 
TCTGTCTTCCTCTTCTGGGCCGAAGGGCTGTCAGCTGCATCTCCCAATTCCGATATCATCCTAGGAACTGAGCCGCCCAGCCTTCA 
CCACCTTTACCTCCTGCATCCTGCGTTCCCCTCCATCCTTCTGGATCTGGCCAACACCACCGGCACAGTGACGGCTTCAGAGGTTG 
GAATTAACGACCTCTGGAAAGATGCCTTTTATGTTACCAGGACAACAGGGCCAAGCTCCGAAGGCCATCCAGCAGCCCTGGTGGTC 
AGCAAGCTTAGTCTACGGTGGGCACTAATGGAGGGCCAGATGGCTCAGCTACAGGAGTCCACCCCCAAAATTGGCCGTGGGGAGCT 
GCGAAGATTTCTCTCTAGGATAAAGTTTGTTGAAGCTCCCTACGAGATC TAA TCTGATGGAATCTTCAGTTGCAGAAGAAGTGAAC 
AGAGTGGATACCCTCTCTACTCTCCTGTCACTGTAAAATCAGTTCTATGGAGAGAAGACTTCTTCGTCCTCATTTACCACCTCCCT 
GATGGTTGCAAAGGCTTGGGAAGGCATGTTGGAGTCTTTGACGGCAGCATGATCTATTTGGCTGGGGCATCTTACCTACCTTTTCA 
GTCCCTGCATTAATCCCCTCTAGGAACTCTGCGTGGATCGTTTGGAAATGTGAATCTCTTAAGTATTTAATTTTTTTGGTATGTCT 
AATTTATGAAGTCTTGCTGGGAAAGCCAGTGAAGTCTATGACTAGGAAACATTTTGTTGTACATTGTGCTGTGTGTGTGTATATTT 
TAGTGTTGTGGTGAAGTTATTTTCCAGGTATGTCCTAAGCTTCAGGGATCCAGTTTCTTGTCCTTCTGAAATATATCTGGTTTGTT 
TGGTCATTTTGAGACTTCCAGATGCCCTACCTCTGATGTTGAGGGCCACTTATTTCTCTCCTTATTCTTTCCCACCTGTACCTTGG 
CTACTTCCAAATTGTAGACAGAATGAGAAAGATTTATAGTGGAAGACTGAGTTAGCCATCCAAGCATTTTCATCTCTCTTGTTTTA 
TATCCTATTTCCTTAGATTTTCCATCCATGTCTATTAAGTGACCACAAGAATAACTATATTCCTATCACAAGGGGAGCAAGAGGAT 
GTAGTCTCAGTGACCCATCTCTGACCAAGTCCACATGTTGTGTTATATGTGGCTCTGATGGTTCTGCCAGTCATGATCTTTTTTCT 
GTGGCGACATCAGAAGTGTATGTTTGCATGCTGTCTTCAACTTAGAGGAGAACTGGAAGTCAGGAGCCTTTGATGTCCTTATCCTG 
CTGTATGTCTTCTCTGCATCTTTTTCTATAGGGCACCCTCCTTAGCTCCCCTCACTCTGTTTTCTCTTCTATTCAGGGATATGTTT 
CTGGACTTTTTCTTCTGCTACTTGAGTCCAGGATGCAACCATTTTGTCCTGCATCTCTTCTTTCCTGTAGAGCCTTTGAAGCATTG 
TATTTTGGGAAAATTCTTCTGTAAATACTATAACTTTTATAAATGGTTAAGTTATTTAGAATTATCTCCAGTGCTTACTTCTCCCT 
TCTTCTGTATAAATCTGCTACTTCAATTAAGTTCTCCTCTAAACTTTTAGGTCATTGTTTATATAGCAGAAAATTCAATGTTAGCG 
GATGGAAAACTGCTTCTTGAATAACCTTGATAGGTCATCCCTGAGTGCACCTCAGGTTCTCTCTTTACCTGGGCTTGTATCTTTTT 
TTTTTTTTTTTTTTTTTTTTTGAGACAGAGTTTTGCTCTTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAAC 
CTTCGCCTCCTGGGTTCAAGCGATTCTCCAGCCTTAGCCTCCCAAGTAGCTGGGACTACAGGTGCCCGCTACCATGCCTGGCTAAT 
TTTTTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCACGAACTCCTGACCTCAGATAATCCACCTG 
CTTCTGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCATGCCCGGCTGGGCTTGTATCTTTTAGCTTGTGTTAGTAAAAGG 
ATTCTAGAAAATTATGAAGTCCAGATTCAAAGGGATCTCTGTTAATTACCCACTGACAGGCATTATGACCTAACAGGAGGTTGGTA 
GCAGTAGATCCAAGCATGCATGTTGCCTGGCCTGTAGATTGGCCTTATCAGGTTTCTGGGTGCCTCTGCCTTAAGATCCTGAAGGC 
AAATTTTGTTTCAACAGTTTGGAAGTCATCTGTGGGTCCAGCTTGACTTTGGAGGAATAAGAAGATACTTCTAGAGTATGGGAATG 
ATTCCAGATAATTTCTGGGATTTGAATCTACTTGAGTTTAAGGGCCTGGGACCTAATTTGGTTTAGTATAGAATTTGAAGAATTAA 
TTTATAGGCAGCTGAATACCCAAAACTTGGGTGGTGGTCCTGTGGTTTGGCTGAGCTGTCCGGGCATAACCTGGTTCTCTGTTATG 
TTAAGGCTTTCTGGGAAGCCAGCCACTCTGCGCAGGAGTGAAACATGAAGTTGTTTTCTGAGGACCTGTTTTGGTGGGATTGTTTG 
GGCAGAGGACTGTGTTTATGCAGGGCAAATCCCAGAAAGATAAGAGGAAGCTAGAGAAACTTAATGTACCTGAATTCTTCATGGTG 
TATTTGCAAACTAACTTAACATAGATTCTTTTGACTATGGTAAGTTTGAATCTCTCCTTGCCAAACAACATTATAAGTTTAGTTTT 
CTTCTTCCTCTTGCAGCCGGTACAGAAAGGTGTAAGTGGTGGCTGAAAATTGAGGAAGCTTCATCTGACCAATGTGGGTGCTGGTT 
TCTTGTGAAATGTGTCCCTAAGCCTCCTTCTCCTTGCAGGCAGCCACCCACCCAGGTGTCTAAGATAGGACATGCTCCTTTCTTTC 
TCTAATCCCATCCTGAGGTTGCCGGCAAAGCCAATATGACCACTACTGAGAAATAGTAATGACTTCTACAAATGCAAGGGTCTTAC 
CCTCCTCTTTCCCTTAAACACCCTCCCTTTTCCTTAGACCCCGTTTTTGCCATCCCCCAAATGTGTGGTATGGTGAAACTAATCCC 
CTGAATGTGAATTGCTATCCTTATTGCCCTATTAAAGAAGAGCCAGCTGGTATATTGTCAGGAAGCACTATTTAAAATGTGAACTG 
T T AT AG AGT AAAT AAAT AAAT AC T C T ACAGG 
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CGCTTAGAACTGTGTTGAGCTCTCACCCATCACGAJTGAGCAACAAATTCCTGGGCACCTGGAAACTTGTCTCTAGCG 
AGAACTTTGACGATTACATGAAAGCTCTGGGTGTGGGGTTAGCCACCAGAAAACTGGGAAATTTGGCCAAACCCACT 
GTGATCATCAGCAAGAAAGGAGATATTATAACTATACGAACTGAAAGTACCTTTAAAAATACAGAAATCTCCTTCAA 
GCTAGGCCAGGAATTTGAAGAAACCACAGCTGACAATAGAAAGACCAAGAGCATCGTAACCCTGCAGAGAGGATCAC 
TGAATCAAGTGCAGAGATGGGATGGCAAAGAGACAACCATAAAGAGAAAGCTAGTGAATGGGAAAATGGTAGCGGAA 
TGTAAAATGAAGGGCGTGGTGTGCACCAGAATCTATGAGAAGGTCTGAAAAATCATTTCTTCATTGAAGTGGCTTTT 
TATCATTTAATGATGGAAATCAATTGCTTCCATTGACAAAACTGAATACACTGCAAATATTTGTTTTTGCTTTTGTC 
TTAATATATCAGATATGCAAAGGCCTAAACTGAGAATTAATCTAAAAGTCAGTGTTATTTAAACATTTTCAATGTGC 
ATGCATGTCATTATTACATCAAAGCATATATATTGGCCAGACACAAACAGTTGATGATGTCATTCAATTAACTACAA 
AAT TCTAAT CTATGTT GAACT T T GT AT ACT T GAAAT G AT AAT AAAAAGGAT AT AATT T CTTAGTAAAAT GAAAT C AA 
AGTATTGATCAGGGTAGCAAACTCAAATGCTGACAGGGGCCAGAGGAGATATGGGGAAGGAGCATCAGAAATGAGGC 
AAGCTAGGAGAATGGGCTATTATAATGTAAAGAATTGTAGTCTCAGTTAAAAGGGGTAGCCTCTACTCCAGCCAACA 
TTTTAAAATTAATGGATAATTTATAGACAGTTAAATTTATAGACAGTTAAGTAAAAATGGATAATTTATAGACAGAT 
AATTTATAGACAGGTAAATGTGAGTTAAATATAACTCACATCCCACTCAAGACACAAAACATTTTCTTAATCCTAGT 
ACATTTTTTTCTGTCCCTTCCCAATCAGTGTCCTTTTCTGTTCCACCCCTACCAAAAGCAAGTAGTGGTTTGGTTTC 
TATCATATAGATTAATTTTACCTGCTCATATGAAGGGAATTGTACATCATGCATTCTTTTCTGTTTGCCTTTTTTAA 
ATTCAGCATCATGTTTTTGTGATACATCCACATTGTTGCATGCAGCTGTAGTTTGTTTCTTTTTATTACCAAGTACT 
ATTTCATTGTATGAATATATCACAGTTTATCCATTCTACTATTAAGACAATTGAGCTATTTCTAATTTTCGGCTGCT 
ATGAATAAAGCTGCTACAAATATTTTTGTACAAGACTTTTTGTAAACATAGGAGTCCCTTTATTTTAAATAAATAAC 
TAGTCATATCATTAGGTCCAGTAATTGTTGACAGGCAGGAACGGGGGACCATTGCATTGTGCCCAAGTAATAATAAA 
ACTATTTCAGATGTATTATATGATTGAGCAAATGAGAAAACATGTTGATGTTGATGGGAGTCAGGATGTTCACTATG 
GAAAAACAAATATACAAATATGAAATGAGGGAAGGCAAGAAAGAACCATGTGGAAATGGAATAGAATTGGTATAAAT 
TCATAATTTCTAAACCATGTATATGTACGTTTATATGTATTATAATTGCATACACATGCCTCCATGCATATATGTGT 
GTGATAATACACATGCATTTATGTGCGTGTGTGTATACACATGCATATATTTACTAATCCTATCTGCCAAAATGGCT 
TAGACACAAAAACACCTCAGCAGAAATGAATATACCTAGCACTCAGATCTTCGTGTCTAATATAGTTTGCCACTAAA 
AGGAACCAAGGCTACTTGGAAAAATGGATGATTCCAAAGCAAGGGCAAGGTAGGAACAAGATGAGCTTGAAATATCT 
TGTTATGCCAGAAAGTAATGTTAAAAAAAAAAAATAGAGGTATATTGCCAAAACATAGAGCCAGCTTGAAGGGGCTC 
CCACTGGCCAAACTTGAGCCAATCTGAGAGCAAAATAATTGAGAAAAATAAATAACAAGATAATTGAAAAAAAATAA 
AATAAATTCCCAATTCCATTGGGAAAAAATAGAAATCCATGAATCCATACTGATATAATGATAGATAATCGATAGAT 
AGGTAGATGACTGACAGATGATAGATAGATATATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGA 
GAAATGAGAAGGTTTCTCTTTTGCAATAGAACGTCATGGACCAGTGTTAAATGTGAGTGGAAGGAGTTCTGGAATTG 
GAAAACCATCATTTTTCAACCATCACAGTAAATATGGCTCAGGCAAGAATTATCAATCAATGCTAAAGCTAGGGGGA 
AATTTCGCTTAGGAGCAGGATATTAGGGTATTAGTCTGGGCTTAAAGTATCTCCTCACAGATTGTTGTTAGTTTCTG 
GGGAAAGAATAGTAACCATGCAATGGAAAAAAATGGACAACCTCTTGACTAGGTTATCAAAATTAACCTCACCAATA 
AAGGGTGGATGTTCAACATGTGCCTTCAAATGTGACCCACTGAGAAGGAAACAACATCACTGTAACAACAACAACCA 
GAAACGACAGGGGGTTTTGACTGAATTCTTCAAAAATGTCAATGTCATAGAAGACAAAGAAAGGTTGTGGAAATGTT 
TCAGATTAAATGATAGTAAAAACACCTGACAACTAAACATAGTAAGTAATACTAGACTGGATTCTGTACCAGAGGTA 
ACATAAGTGCTCCAAAGGACAATGTTAGGTCAACTGGCAAATTGGAATATAGACAGTCAATCAGATAAGAAGTATAC 
TTTGATTAAGTAAAAAAAATCCCTATTCTTGGAAAATACACAATAAAGTATTTTGAGGTAAAGGGCCATAATGTATG 
CAATCTACTCTCAAAAAATTCAGAAACATATATTTGTGTGCATTTGCATGTGCAACAGTACACACAAACATACATAA 
AGAGAGCAATTGATAAGGCAAATAAGGTAACATTTAACAATAATCTGATACACATAAATAGAGAAAGAGCAATTGAT 
AAAGTAAATGAGGTAAAATTTAACAATAATCTGAGCAAAAGGTATATGTGTTTTCTTTGAGACAGTCTGATTCTTGC 
AACTTATTCTGTAAGTTGGAACTTATTTCCAAACATGATTGAAAAAAAACCCCGCACTTGGCAACTTCTTCTCTTTT 
TCAGCCTAGAAATGTCTGTGTTAAGTGGTTTTTTATTTATTGTTGTTGTTTGTTGTTATTGTTGTTTTGTTGCCAGG 
CTCCAACTCACAAAATACGAGTTTAAAAACTGCGTTGTTATTTTTAGAGATTTGTGATAATACAACTTGTTATAAAA 
TTTATTCCTCAATAAATATAATTTCTCTACTATGCAAAAAAAAAAAAAAA 
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CAGCCCCGCGCGCCGGCCGAGTCGCTGAGCCGCGGCTGCCGGACGGGACGGGACCGGCTAGGCTGGGCGCGCCCCCC 
GGGCCCCGCCGTGGGC ATG GGCGCACTGGCCCGGGCGCTGCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCCG 
CCCCGGAGCTGGCCCCCGCGCCCTTCACGCTGCCCCTCCGGGTGGCCGCGGCCACGAACCGCGTAGTTGCGCCCACC 
CCGGGACCCGGGACCCCTGCCGAGCGCCACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGC 
GGGCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGA 
TCGGGACCCCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAGGAACCCCGCAC 
TCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCCAAGGGCTTTGACGTCACAGTGAAGTA 
CACACAAGGAAGCTGGACGGGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTG 
TCAACATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATACTTGGCCTAGCT 
TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACACAAGCAAACATCCCCAA 
CGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGTTGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGG 
GTGGAATTGAACCAAGTTTGTATAAAGGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAA 
ATTCTGAAATTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCCATCGTGGA 
CAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTGGAAGCTGTGGCCCGCGCATCTCTGA 
TTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCCCAGCTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTAC 
TTCCCTAAAATCTCCATCTACCTGAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTA 
CATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCATCCACAAATGCGCTGG 
TGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGC 
CCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTTCCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTG 
TGTCCCCGCTCAGTCTTTGAGCGAGCCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCC 
TCCTTGTCTTAATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCAATGAT 
GAGTCCTCTCTGGTCAGACATCGCTGGAA ATGAA TAGCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAG 
AAAATCACATTTCCAGGGCAGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTT 
CTGCTCCCAGATGCCTTCTAGATTCACTGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCCCTACTTCCAAG 
AAAAATAATTAAAAAAAAAACTTCATTCTAA 
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FIGURE 28 

GAGAGAATAGCTACAGATTCTCCATCCTCAGTCTTTGCAAGGCGACAGCTGTGCCAGCCGGGCTCTGGCAGGCTCCT 
GGCAGCATGGCAGTGAAGCTTGGGACCCTCCTGCTGGCCCTTGCCCTGGGCCTGGCCCAGCCAGCCTCTGCCCGCCG 
GAAGCTGCTGGTGTTTCTGCTGGATGGTTTTCGCTCAGACTACATCAGTGATGAGGCGCTGGAGTCATTGCCTGGTT 
TCAAAGAGATTGTGAGCAGGGGAGTAAAAGTGGATTACTTGACTCCAGACTTCCCTAGTCTCTCGTATCCCAATTAT 
TATACCCTAATGACTGGCCGCCATTGTGAAGTCCATCAGATGATCGGGAACTACATGTGGGACCCCACCACCAACAA 
GTCCTTTGACATTGGCGTCAACAAAGACAGCCTAATGCCTCTCTGGTGGAATGGATCAGAACCTCTGTGGGTCACTC 
TGACCAAGGCCAAAAGGAAGGTCTACATGTACTACTGGCCAGGCTGTGAGGTTGAGATTCTGGGTGTCAGACCCACC 
TACTGCCTAGAATATAAAAATGTCCCAACGGATATCAATTTTGCCAATGCAGTCAGCGATGCTCTTGACTCCTTCAA 
GAGTGGCCGGGCCGACCTGGCAGCCATATACCATGAGCGCATTGACGTGGAAGGCCACCACTACGGGCCTGCATCTC 
CGCAGAGGAAAGATGCCCTCAAGGCTGTAGACACTGTCCTGAAGTACATGACCAAGTGGATCCAGGAGCGGGGCCTG 
CAGGACCGCCTGAACGTCATTATTTTCTCGGATCACGGAATGACCGACATTTTCTGGATGGACAAAGTGATTGAGCT 
GAATAAGTACATCAGCCTGAATGACCTGCAGCAAGTGAAGGACCGCGGGCCTGTTGTGAGCCTTTGGCCGGCCCCTG 
GGAAACACTCTGAGATATATAACAAACTGAGCACAGTGGAACACATGACTGTCTACGAGAAAGAAGCCATCCCAAGC 
AGGTTCTATTACAAGAAAGGAAAGTTTGTCTCTCCTTTGACTTTAGTGGCTGATGAAGGCTGGTTCATAACTGAGAA 
TCGAGAGATGCTTCCGTTTTGGATGAACAGCACCGGCAGGCGGGAAGGTTGGCAGCGTGGATGGCACGGCTACGACA 
ACGAGCTCATGGACATGCGGGGCATCTTCCTGGCCTTCGGACCTGATTTCAAATCCAACTTCAGAGCTGCTCCTATC 
AGGTCGGTGGACGTCTACAATGTCATGTGCAATGTGGTGGGCATCACCCCGCTGCCCAACAACGGATCCTGGTCCAG 
GGTGATGTGCATGCTGAAGGGCCGCGCCGGCACTGCCCCGCCTGTCTGGCCCAGCCACTGTGCCCTGGCACTGATTC 
TTCTCTTCCTGCTTGCATAACTGATCATATTGCTTGTCTCAGAAAAAAACACCATCAGCAAAGTGGGCCTCCAAAGC 
CAGAT G AT T T T C AT T T T ATG T G T G AAT AAT AG C T T C ATT AAC ACAAT C AAGACCAT GC AC AT TGT AAAT AC AT TAT T 
CTTGGATAATTCTATACATAAAAGTTCCTACTTGTTAAA 



30/170 

FIGURE 29 



GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTGGTACTCTTGGGAGTTTC 
CATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTCCAGCTGACACGTATCCAGCTACTGGTCCTGCTGATG 
ATGAAGCCCCTGATGCTGAAACCACTGCTGCTGCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCT 
GCTTCTACCACTGCTCGTAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGAGTGTGTCC 
CTGAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCTTCCTGTGATTTCATCCAACTACTTA 
CCTTGCCTACGATATCCCCTTTATCTCTAATCAGTTTATTTTCTTTCAAATAAAAAATAACTATGAGCAACATAAAA 
AAAAAAAAA 
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FIGURE 30 

GACTACGGGGAGAGAGAGGAGACCAGGACAGCTGCTGAGACCTCTAAGAAGTCCAGATACTAAGAGCAAAGATGTTT 
CAAACTGGGGGCCTCATTGTCTTCTACGGGCTGTTAGCCCAGACCATGGCCCAGTTTGGAGGCCTGCCCGTGCCCCT 
GGACCAGACCCTGCCCTTGAATGTGAATCCAGCCCTGCCCTTGAGTCCCACAGGTCTTGCAGGAAGCTTGACAAATG 
CCCTCAGCAATGGCCTGCTGTCTGGGGGCCTGTTGGGCATTCTGGAAAACCTTCCGCTCCTGGACATCCTGAAGCCT 
GGAGGAGGTACTTCTGGTGGCCTCCTTGGGGGACTGCTTGGAAAAGTGACGTCAGTGATTCCTGGCCTGAACAACAT 
CATTGACATAAAGGTCACTGACCCCCAGCTGCTGGAACTTGGCCTTGTGCAGAGCCCTGATGGCCACCGTCTCTATG 
TCACCATCCCTCTCGGCATAAAGCTCCAAGTGAATACGCCCCTGGTCGGTGCAAGTCTGTTGAGGCTGGCTGTGAAG 
CTGGACATCACTGCAGAAATCTTAGCTGTGAGAGATAAGCAGGAGAGGATCCACCTGGTCCTTGGTGACTGCACCCA 
TTCCCCTGGAAGCCTGCAAATTTCTCTGCTTGATGGACTTGGCCCCCTCCCCATTCAAGGTCTTCTGGACAGCCTCA 
CAGGGATCTTGAATAAAGTCCTGCCTGAGTTGGTTCAGGGCAACGTGTGCCCTCTGGTCAATGAGGTTCTCAGAGGC 
TTGGACATCACCCTGGTGCATGACATTGTTAACATGCTGATCCACGGACTACAGTTTGTCATCAAGGTCTAAGCCTT 
CCAGGAAGGGGCTGGCCTCTGCTGAGCTGCTTCCCAGTGCTCACAGATGGCTGGCCCATGTGCTGGAAGATGACACA 
GTTGCCTTCTCTCCGAGGAACCTGCCCCCTCTCCTTTCCCACCAGGCGTGTGTAACATCCCATGTGCCTCACCTAAT 
AAAATGGCTCTTCTTATGCA 
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FIGURE 31 

GGGCGCTGGGAGACACCGGACGCCCGCTCGGCTGCGCTGCGGCTCAGGCCCCCGCTCGGGCCCGACCCGCTCGGTCA 
CCGCCGGCTCGGGCGCGCACCTGCCGGCTGCGGCCCCAGGGCCATGCGGAGGCCCACGAGGAGGCCGGCGGCCACGC 
GCATCCCGTAGCCCAGGTGGCCCAGGTCTGCACCGCGGCGGCCTCGGCGCC ATG GAGCCCCCGTATTCGCTGACGGC 
GCACTACGATGAGTTCCAAGAGGTCAAGTACGTGAGCCGCTGCGGCGCGGGGGGCGCGCGCGGGGCCTCCCTGCCCC 
CGGGCTTCCCGTTGGGCGCTGCGCGCAGCGTCACCGGGGCCCGGTCCGGGCTGCCGCGCTGGAACCGGCGCGAGGTG 
TGCCTGCTGTCGGGGCTGGTGTTCGCCGCCGGCCTCTGCGCCATTCTGGCGGCTATGCTGGCCCTCAAGTACCTGGG 
CCCGGTCGCGGCCGGCGGCGGCGCCTGTCCCGAGGGCTGCCCTGAGCGCAAGGCCTTCGCGCGCGCCGCTCGCTTCC 
TGGCCGCCAACCTGGACGCCAGCATCGACCCATGCCAGGACTTCTACTCGTTCGCCTGCGGCGGTTGGCTGCGGCGC 
CACGCCATCCCCGACGACAAGCTCACCTATGGCACCATCGCGGCCATCGGCGAGCAAAACGAGGAGCGCCTACGGCG 
CCTGCTGGCGCGGCCCGGGGGTGGGCCTGGCGGCGCGGCCCAGCGCAAGGTGCGCGCCTTCTTCCGCTCGTGCCTCG 
ACATGCGCGAGATCGAGCGACTGGGCCCGCGACCCATGCTAGAGGTCATCGAGGACTGCGGGGGCTGGGACCTGGGC 
GGCGCGGAGGAGCGTCCGGGGGTCGCGGCGCGATGGGACCTCAACCGGCTGCTGTACAAGGCGCAGGGCGTGTACAG 
CGCCGCCGCGCTCTTCTCGCTCACGGTCAGCCTGGACGACAGGAACTCCTCGCGCTACGTCATCCGCATTGACCAGG 
ATGGGCTCACCCTGCCAGAGAGGACCCTGTACCTCGCTCAGGATGAGGACAGTGAGAAGATCCTGGCAGCATACAGG 
GTGTTCATGGAGCGAGTGCTCAGCCTCCTGGGTGCAGACGCTGTGGAACAGAAGGCCCAAGAGATCCTGCAAGTGGA 
GCAGCAGCTGGCCAACATCACTGTGTCAGAGTATGACGACCTACGGCGAGATGTCAGCTCCATGTACAACAAGGTGA 
CGCTGGGGCAGCTGCAGAAGATCACCCCCCACTTGCGGTGGAAGTGGCTGCTAGACCAGATCTTCCAGGAGGACTTC 
TCAGAGGAAGAGGAGGTGGTGCTGCTGGCGACAGACTACATGCAGCAGGTGTCGCAGCTCATCCGCTCCACACCCCA 
CCGGGTCCTGCACAACTACCTGGTGTGGCGCGTGGTGGTGGTCCTGAGTGAACACCTGTCCCCGCCATTCCGTGAGG 
CACTGCACGAGCTGGCACAGGAGATGGAGGGCAGCGACAAGCCACAGGAGCTGGCCCGGGTCTGCTTGGGCCAGGCC 
AATCGCCACTTTGGCATGGCGCTTGGCGCCCTCTTTGTACATGAGCACTTCTCAGCCGCCAGCAAAGCCAAGGTGCA 
GCAGCTAGTGGAAGACATCAAGTACATCCTGGGCCAGCGCCTGGAGGAGCTGGACTGGATGGACGCCGAGACCAGGG 
CTGCTGCTCGGGCCAAGCTCCAGTACATGATGGTGATGGTCGGCTACCCGGACTTCCTGCTGAAACCCGATGCTGTG 
GACAAGGAGTATGAGTTTGAGGTCCATGAGAAGACCTACTTCAAGAACATCTTGAACAGCATCCCCTTCAGCATCCA 
GCTCTCAGTTAAGAAGATTCGGCAGGAGGTGGACAAGTCCACGTGGCTGCTCCCCCCACAGGCGCTCAATGCCTACT 
ATCTACCCAACAAGAACCAGATGGTGTTCCCCGCGGGCATCCTGCAGCCCACCCTGTACGACCCTGACTTCCCACAG 
TCTCTCAACTACGGGGGCATCGGCACCATCATTGGACATGAGCTGACCCACGGCTACGACGACTGGGGGGGCCAGTA 
TGACCGCTCAGGGAACCTGCTGCACTGGTGGACGGAGGCCTCCTACAGCCGCTTCCTGCGAAAGGCTGAGTGCATCG 
TCCGTCTCTATGACAACTTCACTGTCTACAACCAGCGGGTGAACGGGAAACACACGCTTGGGGAGAACATCGCAGAT 
ATGGGCGTCCTCAAGCTGGCCTACCACGCCTATCAGAAGTGGGTGCGGGAGCACGGCCCAGAGCACCCACTTCCCCG 
GCTCAAGTACACACATGACCAGCTCTTCTTCATTGCCTTTGCCCAGAACTGGTGCATCAAGCGGCGGTCGCAGTCCA 
TCTACCTGCAGGTGCTGACTGACAAGCATGCCCCTGAGCACTACAGGGTGCTGGGCAGTGTGTCCCAGTTTGAGGAG 
TTTGGCCGGGCTTTCCACTGTCCCAAGGACTCACCCATGAACCCTGCCCACAAGTGTTCCGTGTGG TGA GCCTGGCT 
GCCCGCCTGCACGCCCCCACTGCCCCCGCACGAATCACCTCCTGCTGGCTACCGGGGCAGGCATGCACCCGGTGCCA 
GCCCCGCTCTGGGCACCACCTGCCTTCCAGCCCCTCCAGGACCCGGTCCCCCTGCTGCCCCTCACTTCAGGAGGGGC 
CTGGAGCAGGGTGAGGCTGGACTTTGGGGGGCTGTGAGGGAAATATACTGGGGTCCCCAGATTCTGCTCTAAGGGGG 
CCAGACCCTCTGCCAGGCTGGATTGTACGGGCCCCACCTTCGCTGTGTTCTTGCTGCAAAGTCTGGTCAATAAATCA 
CTGCACTGTTAAAAAAAAA 
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GAGAGAACAGCGTGAGCCTGTGTGCTTGTGTGCTGAGCCCTCATCCCCTCCTGGGGCCAGGCTTGGGTTTCACCTGC 
AGAATCGCTTGTGCTGGGCTGCCTGGGCTGTCCTCAGTGGCACCTGCATGAAGCCGTTCTGGCTGCCAGAGCTGGAC 
AGCCCCAGGAAAACCCACCTCTCTGCAGAGCTTGCCCAGCTGTCCCCGGGAAGCCAAATGCCTCTCATGTAAGTCTT 
CTGCTCGACGGGGTGTCTCCTAAACCCTCACTCTTCAGCCTCTGTTTGACCATGAAATGAAGTGACTGAGCTCTATT 
CTGTACCTGCCACTCTATTTCTGGGGTGACTTTTGTCAGCTGCCCAGAATCTCCAAGCCAGGCTGGTTCTCTGCATC 
CTTTCAATGACCTGTTTTCTTCTGTAACCACAGGTTCGGTGGTGAGAGGAAGCCTCGCAGAATCCAGCAGAATCCTC 
ACAGAATCCAGCAGCAGCTCTGCTGGGGACATGGTCCATGGTGCAACCCACAGCAAAGCCCTGACCTGACCTCCTGA 
TGCTCAGGAGAAGCCATGGGCCCCTCCTGTCCTGTGTTCCTGTCCTTCACAAAGCTCAGCCTGTGGTGGCTCCTTCT 
GACCCCAGCAGGTGGAGAGGAAGCTAAGCGCCCACCTCCCAGGGCTCCTGGAGACCCACTCTCCTCTCCCAGTCCCA 
CGGCATTGCCGCAGGGAGGCTCGCATACCGAGACTGAGGACCGGCTCTTCAAACACCTCTTCCGGGGCTACAACCGC 
TGGGCGCGCCCGGTGCCCAACACTTCAGACGTGGTGATTGTGCGCTTTGGACTGTCCATCGCTCAGCTCATCGATGT 
GGATGAGAAGAACCAAATGATGACCACCAACGTCTGGCTAAAACAGGAGTGGAGCGACTACAAACTGCGCTGGAACC 
CCGCTGATTTTGGCAACATCACATCTCTCAGGGTCCCTTCTGAGATGATCTGGATCCCCGACATTGTTCTCTACAAC 
AATGCAGATGGGGAGTTTGCAGTGACCCACATGACCAAGGCCCACCTCTTCTCCACGGGCACTGTGCACTGGGTGCC 
CCCGGCCATCTACAAGAGCTCCTGCAGCATCGACGTCACCTTCTTCCCCTTCGACCAGCAGAACTGCAAGATGAAGT 
TTGGCTCCTGGACTTATGACAAGGCCAAGATCGACCTGGAGCAGATGGAGCAGACTGTGGACCTGAAGGACTACTGG 
GAGAGCGGCGAGTGGGCCATCGTCAATGCCACGGGCACCTACAACAGCAAGAAGTACGACTGCTGCGCCGAGATCTA 
CCCCGACGTCACCTACGCCTTCGTCATCCGGCGGCTGCCGCTCTTCTACACCATCAACCTCATCATCCCCTGCCTGC 
TCATCTCCTGCCTCACTGTGCTGGTCTTCTACCTGCCCTCCGACTGCGGCGAGAAGATCACGCTGTGCATTTCGGTG 
CTGCTGTCACTCACCGTCTTCCTGCTGCTCATCACTGAGATCATCCCGTCCACCTCGCTGGTCATCCCGCTCATCGG 
CGAGTACCTGCTGTTCACCATGATCTTCGTCACCCTGTCCATCGTCATCACCGTCTTCGTGCTCAATGTGCACCACC 
GCTCCCCCAGCACCCACACCATGCCCCACTGGGTGCGGGGGGCCCTTCTGGGCTGTGTGCCCCGGTGGCTTCTGATG 
AACCGGCCCCCACCACCCGTGGAGCTCTGCCACCCCCTACGCCTGAAGCTCAGCCCCTCTTATCACTGGCTGGAGAG 
CAACGTGGATGCCGAGGAGAGGGAGGTGGTGGTGGAGGAGGAGGACAGATGGGCATGTGCAGGTCATGTGGCCCCCT 
CTGTGGGCACCCTCTGCAGCCACGGCCACCTGCACTCTGGGGCCTCAGGTCCCAAGGCTGAGGCTCTGCTGCAGGAG 
GGTGAGCTGCTGCTATCACCCCACATGCAGAAGGCACTGGAAGGTGTGCACTACATTGCCGACCACCTGCGGTCTGA 
GGATGCTGACTCTTCGGTGAAGGAGGACTGGAAGTATGTTGCCATGGTCATCGACAGGATCTTCCTCTGGCTGTTTA 
TCATCGTCTGCTTCCTGGGGACCATCGGCCTCTTTCTGCCTCCGTTCCTAGCTGGAATGATC TGA CTGCAGCTCCCT 
CGAGCTGGCTCCCAGGGCAAAGGGGAGGGTTCTTGGATGTGGAAGGGCTTTGAACAATGTTTAGATTTGGAGATGAG 
CCCAAAGTGCCAGGGAGAACAGCCAGGTGAGGTGGGAGGTTGGAGAGCCAGGTGAGGTCTCTCTAAGTCAGGCTGGG 
GTTGAAGTTTGGAGTCTGTCCGAGTTTGCAGGGTGCTGAGCTGTATGGTCCAGCAGGGGAGTAATAAGGGCTCTTCC 
GGAAGGGGAGGAAGCGGGAGGCAGGCCTGCACCTGATGTGGAGGTACAGGCAGATCTTCCCTACCGGGGAGGGATGG 
ATGGTTGGATACAGGTGGCTGGGCTATTCCATCCATCTGGAAGCACATTTGAGCCTCCAGGCTTCTCCTTGACGTCA 
TTCCTCTCCTTCCTTGCTGCAAAATGGCTCTGCACCAGCCGGCCCCCAGGAGGTCTGGCAGAGCTGAGAGCCATGGC 
CTGCAGGGGCTCCATATGTCCCTACGCGTGCAGCAGGCAAACAAGA 
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FIGURE 33 

CTCTGTCCTGGGACTTGGTGGTGCTACCCTTGGCCTCCCACAGTCCTGCCACCCTGCTGCCGCCACCATGCTGCCCC 
CTGGGACTGCGACCCTCTTGACTCTGCTCCTGGCAGCTGGCTCGCTGGGCCAGAAGCCTCAGAGGCCACGCCGGCCC 
GCATCCCCCATCAGCACCATCCAGCCCAAGGCCAATTTTGATGCTCAGCAGTTTGCAGGGACCTGGCTCCTTGTGGC 
TGTGGGCTCCGCTTGCCGTTTCCTGCAGGAGCAGGGCCACCGGGCCGAGGCCACCACACTGCATGTGGCTCCCCAGG 
GCACAGCCATGGCTGTCAGTACCTTCCGAAAGCTGGATGGGATCTGCTGGCAGGTGCGCCAGCTCTATGGAGACACA 
GGGGTCCTCGGCCGCTTCCTGCTTCAAGCCCGAGGCGCCCGAGGGGCTGTGAACGTGGTTGTCGCTGAGACTGACTA 
CCAGAGTTTCGCTGTCCTGTACCTGGAGCGGGCGGGGCAGCTGTCAGTGAAGCTCTACGCCCGCTCGCTCCCTGTGA 
GCGACTCGGTCCTGAGTGGGTTTGAGCAGCGGGTCCAGGAGGCCCACCTGACTGAGGACCAGATCTTCTACTTCCCC 
AAGTACGGCTTCTGCGAGGCTGCAGACCAGTTCCACGTCCTGGACGAAGTGAGGAGGTGAGGCCGGCACACAGCTCC 
AGTGCTGAGAAGTCAGTGCCCCGAGAGACGACCCCACCAGTGGGGTGCCCGCTGCCTGTCCTCCGTGAAACCAGCCT 
CAGATCAGGGCCCTGCCACCCAGGGCAGGGGATCTTCTGCCGGCTGCCCCAGAGGACAGTGGGTGGAGTGGTACCTA 
CTTATTAAATGTCTCAGACCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 34 

AAGAACAATTGTCTCTGGACGGCAGCTATGCGACTCACCGTGCTGTGTGCTGTGTGCCTGCTGCCTGGCAGCCTGGC 
CCTGCCGCTGCCTCAGGAGGCGGGAGGCATGAGTGAGCTACAGTGGGAACAGGCTCAGGACTATCTCAAGAGATTTT 
ATCTCTATGACTCAGAAACAAAAAATGCCAACAGTTTAGAAGCCAAACTCAAGGAGATGCAAAAATTCTTTGGCCTA 
CCTATAACTGGAATGTTAAACTCCCGCGTCATAGAAATAATGCAGAAGCCCAGATGTGGAGTGCCAGATGTTGCAGA 
ATACTCACTATTTCCAAATAGCCCAAAATGGACTTCCAAAGTGGTCACCTACAGGATCGTATCATATACTCGAGACT 
TACCGCATATTACAGTGGATCGATTAGTGTCAAAGGCTTTAAACATGTGGGGCAAAGAGATCCCCCTGCATTTCAGG 
AAAGTTGTATGGGGAACTGCTGACATCATGATTGGCTTTGCGCGAGGAGCTCATGGGGACTCCTACCCATTTGATGG 
GCCAGGAAACACGCTGGCTCATGCCTTTGCGCCTGGGACAGGTCTCGGAGGAGATGCTCACTTCGATGAGGATGAAC 
GCTGGACGGATGGTAGCAGTCTAGGGATTAACTTCCTGTATGCTGCAACTCATGAACTTGGCCATTCTTTGGGTATG 
GGACATTCCTCTGATCCTAATGCAGTGATGTATCCAACCTATGGAAATGGAGATCCCCAAAATTTTAAACTTTCCCA 
GGATGATATTAAAGGCATTCAGAAACTATATGGAAAGAGAAGTAATTCAAGAAAGAAA TAG AAACTTCAGGCAGAAC 
ATCCATTCATTCATTCATTGGATTGTATATCATTGTTGCACAATCAGAATTGATAAGCACTGTTCCTCCACTCCATT 
TAGCAATTATGTCACCCTTTTTTATTGCAGTTGGTTTTTGAATGTCTTTCACTCCTTTTATTGGTTAAACTCCTTTA 
TGGTGTGACTGTGTCTTATTCCATCTATGAGCTTTGTCAGTGCGCGTAGATGTCAATAAATGTTACATACACAAATA 
AATAAAATATTTAGGCCATGGTAAATTTACC 
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FIGURE 35 

GAGCGAGCACCTTCGACGCGGTCCGGGGACCCCCTCGTCGCTGTCCTCCCGACGCGGACCCGCGTGCCCCAGGCCTC 
GCGCTGCCCGGCCGGCTCCTCGTGTCCCACTCCCGGCGCACGCCCTCCCGCGCCCCTCTTCTCGGCGCGCGCGCAGC 
ATGGCGCCCCCGCAGGTCCTCGCGTTCGGGCTTCTGCTTGCCGCGGCGACGGCGACTTTTGCCGCAGCTCAGGAAGA 
ATGTGTCTGTGAAAACTACAAGCTGGCCGTAAACTGCTTTGTGAATAATAATCGTCAATGCCAGTGTACTTCAGTTG 
GTGCACAAAATACTGTCATTTGCTCAAAGCTGGCTGCCAAATGTTTGGTGATGAAGGCAGAAATGAATGGCTCAAAA 
CTTGGGAGAAGAGCAAAACCTGAAGGGGCCCTCCAGAACAATGATGGGCTTTATGATCCTGACTGCGATGAGAGCGG 
GCTCTTTAAGGCCAAGCAGTGCAACGGCACCTCCACGTGCTGGTGTGTGAACACTGCTGGGGTCAGAAGAACAGACA 
AGGACACTGAAATAACCTGCTCTGAGCGAGTGAGAACCTACTGGATCATCATTGAACTAAAACACAAAGCAAGAGAA 
AAACCTTATGATAGTAAAAGTTTGCGGACTGCACTTCAGAAGGAGATCACAACGCGTTATCAACTGGATCCAAAATT 
TATCACGAGTATTTTGTATGAGAATAATGTTATCACTATTGATCTGGTTCAAAATTCTTCTCAAAAAACTCAGAATG 
ATGTGGACATAGCTGATGTGGCTTATTATTTTGAAAAAGATGTTAAAGGTGAATCCTTGTTTCATTCTAAGAAAATG 
GACCTGACAGTAAATGGGGAACAACTGGATCTGGATCCTGGTCAAACTTTAATTTATTATGTTGATGAAAAAGCACC 
TGAATTCTCAATGCAGGGTCTAAAAGCTGGTGTTATTGCTGTTATTGTGGTTGTGGTGATGGCAGTTGTTGCTGGAA 
TTGTTGTGCTGGTTATTTCCAGAAAGAAGAGAATGGCAAAGTATGAGAAGGCTGAGATAAAGGAGATGGGTGAGATG 
CATAGGGAACTCAATGCATAACTATATAATTTGAAGATTATAGAAGAAGGGAAATAGCAAATGGACACAAATTACAA 
ATGTGTGTGCGTGGGACGAAGACATCTTTGAAGGTCATGAGTTTGTTAGTTTAACATCATATATTTGTAATAGTGAA 
ACCTGTACTCAAAATATAAGCAGCTTGAAACTGGCTTTACCAATCTTGAAATTTGACCACAAGTGTCTTATATATGC 
AGATCTAATGTAAAATCCAGAACTTGGACTCCATCGTTAAAATTATTTATGTGTAACATTCAAATGTGTGCATTAAA 
TATGCTTCCACAGTAAAATCTGAAAAACTGATTTGTGATTGAAAGCTGCCTTTCTATTTACTTGAGTCTTGTACATA 
CAT AC TTTTTTAT GAGC T ATGAAAT AAAAC AT T T TAAACTG 
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FIGURE 36 

GCTCCGGGCTGAAGATTGCTTCTCTTCTCTCCTCCAAGGTCTAGTGACGGAGCCCGCGCGCGCGCCACC ATG CGGCA 
GAAGGCGGTATCCGTTTTCTTGTGCTACCTGCTGCTCTTCACTTGCAGTGGGGTGGAGGCAGGTAAGAAAAAGTGCT 
CGGAGAGCTCGGACAGCGGCTCCGGGTTCTGGAAGGCCCTGACCTTCATGGCCGTCGGAGGAGGACTCGCAGTCGCC 
GGGCTGCCCGCGCTGGGCTTCACCGGCGCCGGCATCGCGGCCAACTCGGTGGCTGCCTCGCTGATGAGCTGGTCTGC 
GATCCTGAATGGGGGCGGCGTGCCCGCCGGGGGGCTAGTGGCCACGCTGCAGAGCCTCGGGGCTGGTGGCAGCAGCG 
TCGTCATAGGTAATATTGGTGCCCTGATGCGGTACGCCACCCACAAGTATCTCGATAGTGAGGAGGATGAGGAG TAG 
CCAGCAGCTCCCAGAACCTCTTCTTCCTTCTTGGCCTAACTCTTCCAGTTAGGATCTAGAACTTTGCCTTTTTTTTT 
TTTTTTTTTTTTTTGAGATGGGTTCTCACTATATTGTCCAGGCTAGAGTGCAGTGGCTATTCACAGATGCGAACATA 
GTACACTGCAGCCTCCAACTCCTAGCCTCAAGTGATCCTCCTGTCTCAACCTCCCAAGTAGGATTACAAGCATGCGC 
CGACGATGCCCAGAATCCAGAACTTTGTCTATCACTCTCCCCAACAACCTAGATGTGAAAACAGAATAAACTTCACC 
CAGAAAA 
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FIGURE 37 

GCCGCCGAGGGCAGCCAGCCCCTCCCCTACCCGGAGCAGCCCGCTGGGGCCGTCCCGAGCGGCGACACACTAGGAGT 
CCCGGCCGGCCAGCCAGGGCAGCCGCGGTCCCGGGACTCGGCCGTGAGTGCTGCGGGACGGATGGTGGCGGCGGGAG 
CGCGGAGACCACGGCGGGCGCCGTGGAGCCGGGCGCCGTGCAGCCGGAGCTGCGCGCGGGGCATGCGGCTGCGCCCC 
GGCCCCTCGGCCCCCGGCCTCGGCCCCCGCGCTCCGGCCCCAGCCCCGGCCGCCGGCCCCCGCGGAGTGCAGCGACC 
GCGCCGCCGCTGAGGGAGGCGCCCCACC ATG CCGCGGGCCCCGGCGCCGCTGTACGCCTGCCTCCTGGGGCTCTGCG 
CGCTCCTGCCCCGGCTCGCAGGTCTCAACATATGCACTAGTGGAAGTGCCACCTCATGTGAAGAATGTCTGCTAATC 
CACCCAAAATGTGCCTGGTGCTCCAAAGAGGACTTCGGAAGCCCACGGTCCATCACCTCTCGGTGTGATCTGAGGGC 
AAACCTTGTCAAAAATGGCTGTGGAGGTGAGATAGAGAGCCCAGCCAGCAGCTTCCATGTCCTGAGGAGCCTGCCCC 
TCAGCAGCAAGGGTTCGGGCTCTGCAGGCTGGGACGTCATTCAGATGACACCACAGGAGATTGCCGTGAACCTCCGG 
CCCGGTGACAAGACCACCTTCCAGCTACAGGTTCGCCAGGTGGAGGACTATCCTGTGGACCTGTACTACCTGATGGA 
CCTCTCCCTGTCCATGAAGGATGACTTGGACAATATCCGGAGCCTGGGCACCAAACTCGCGGAGGAGATGAGGAAGC 
TCACCAGCAACTTCCGGTTGGGATTTGGGTCTTTTGTTGATAAGGACATCTCTCCTTTCTCCTACACGGCACCGAGG 
TACCAGACCAATCCGTGCATTGGTTACAAGTTGTTTCCAAATTGCGTCCCCTCCTTTGGGTTCCGCCATCTGCTGCC 
TCTCACAGACAGAGTGGACAGCTTCAATGAGGAAGTTCGGAAACAGAGGGTGTCCCGGAACCGAGATGCCCCTGAGG 
GGGGCTTTGATGCAGTACTCCAGGCAGCCGTCTGCAAGGAGAAGATTGGCTGGCGAAAGGATGCACTGCATTTGCTG 
GTGTTCACAACAGATGATGTGCCCCACATCGCATTGGATGGAAAATTGGGAGGCCTGGTGCAGCCACACGATGGCCA 
GTGCCACCTGAACGAGGCCAACGAGTACACAGCATCCAACCAGATGGACTATCCATCCCTTGCCTTGCTTGGAGAGA 
AATTGGCAGAGAACAACATCAACCTCATCTTTGCAGTGACAAAAAACCATTATATGCTGTACAAGAATTTTACAGCC 
CTGATACCTGGAACAACGGTGGAGATTTTAGATGGAGACTCCAAAAATATTATTCAACTGATTATTAATGCATACAA 
TAGTATCCGGTCTAAAGTGGAGTTGTCAGTCTGGGATCAGCCTGAGGATCTTAATCTCTTCTTTACTGCTACCTGCC 
AAGATGGGGTATCCTATCCTGGTCAGAGGAAGTGTGAGGGTCTGAAGATTGGGGACACGGCATCTTTTGAAGTATCA 
TTGGAGGCCCGAAGCTGTCCCAGCAGACACACGGAGCATGTGTTTGCCCTGCGGCCGGTGGGATTCCGGGACAGCCT 
GGAGGTGGGGGTCACCTACAACTGCACGTGCGGCTGCAGCGTGGGGCTGGAACCCAACAGCGCCAGGTGCAACGGGA 
GCGGGACCTATGTCTGCGGCCTGTGTGAGTGCAGCCCCGGCTACCTGGGCACCAGGTGCGAGTGCCAGGATGGGGAG 
AACCAGAGCGTGTACCAGAACCTGTGCCGGGAGGCAGAGGGCAAGCCACTGTGCAGCGGGCGTGGGGACTGCAGCTG 
CAACCAGTGCTCCTGCTTCGAGAGCGAGTTTGGCAAGATCTATGGGCCTTTCTGTGAGTGCGACAACTTCTCCTGTG 
CCAGGAACAAGGGAGTCCTCTGCTCAGGCCATGGCGAGTGTCACTGCGGGGAATGCAAGTGCCATGCAGGTTACATC 
GGGGACAACTGTAACTGCTCGACAGACATCAGCACATGCCGGGGCAGAGATGGCCAGATCTGCAGCGAGCGTGGGCA 
CTGTCTCTGTGGGCAGTGCCAATGCACGGAGCCGGGGGCCTTTGGGGAGATGTGTGAGAAGTGCCCCACCTGCCCGG 
ATGCATGCAGCACCAAGAGAGATTGCGTCGAGTGCCTGCTGCTCCACTCTGGGAAACCTGACAACCAGACCTGCCAC 
AGCCTATGCAGGGATGAGGTGATCACATGGGTGGACACCATCGTGAAAGATGACCAGGAGGCTGTGCTATGTTTCTA 
CAAAACCGCCAAGGACTGCGTCATGATGTTCACCTATGTGGAGCTCCCCAGTGGGAAGTCCAACCTGACCGTCCTCA 
GGGAGCCAGAGTGTGGAAACACCCCCAACGCCATGACCATCCTCCTGGCTGTGGTCGGTAGCATCCTCCTTGTTGGG 
CTTGCACTCCTGGCTATCTGGAAGCTGCTTGTCACCATCCACGACCGGAGGGAGTTTGCAAAGTTTCAGAGCGAGCG 
ATCCAGGGCCCGCTATGAAATGGCTTCAAATCCATTATACAGAAAGCCTATCTCCACGCACACTGTGGACTTCACCT 
TCAACAAGTTCAACAAATCCTACAATGGCACTGTGGACTGATGTTTCCTTCTCCGAGGGGCTGGAGCGGGGATCTGA 
TGAAAAGGTCAGACTGAAACGCCTTGCACGGCTGCTCGGCTTGATCACAGCTCCCTAGGTAGGCACCACAGAGAAGA 
CCTTCTAGTGAGCCTGGGCCAGGAGCCCACAGTGCTGTACAACAAGGGAAAGGTAGCCTGGCCATGTCACCTGGCTG 
CTAGCCAGAGCCATGCCAGGTTCGCGTCCCTAAGAGCTTGGGATAAAGCAAGGGGACCTTGGCGCTCTCAGCTTTCC 
CTGCCACATCCAGCTTGTTGTCCCAATGAAATACTGAGATGCTGGGCTGTCTCTCCCTTCCAGGAATCGTGGGCCCC 
CAGCCTGGCCAGACAAGAAGACTGTCAGGAAGGGTCGGAGTCTGTAAAACCAGCATACAGTTTGGCTTTTTTCACAT 
TGATCATTTTTATATGAAATAAAAAGATCCTGCATTTATGGTGTAGTTCTGAGTCCTGAGACTTTTCTGCGTGATGC 
TATGCCTTGCACACAGGTGTTGGTGATGGGGCTGTTGAGATGCCTGTTGAAGGTACATCGTTTGCAAATGTCAGTTT 
CCTCTCCTGTCCGTGTTTGTTTAGTACTTTTATAATGAAAAGAAACAAGATTGTTTGGGATTGGAAGTAAAGATTAA 
AACCAAAAGAATTTGTGTTTGTCTGCCC 



39/170 



FIGURE 38 

CGACTCACTATAGGGCGAATTGAATTTAGCGGCCGCGAATTCGCCCTT ATG CTGCCACAAATACCCTTTTTGCTGCT 
AGTATCCTTGAACTTGGTTCATGGAGTGTTTTACGCTGAACGATACCAAATGCCCACAGGCATAAAAGGCCCACTAC 
CCAACACCAAGACACAGTTCTTCATTCCCTACACCATAAAGAGTAAAGGTATAGCAGTAAGAGGAGAGCAAGGTACT 
CCTGGTCCACCAGGCCCTGCTGGACCTCGAGGGCACCCAGGTCCTTCTGGACCACCAGGAAAACCAGGCTACGGAAG 
TCCTGGACTCCAAGGAGAGCCAGGGTTGCCAGGACCACCGGGACCATCAGCTGTAGGGAAACCAGGTGTGCCAGGAC 
TCCCAGGAAAACCAGGAGAGAGAGGACCATATGGACCAAAAGGAGATGTTGGACCAGCTGGCCTACCAGGACCCCGG 
GGCCCACCAGGACCACCTGGAATCCCTGGACCGGCTGGAATTTCTGTGCCAGGAAAACCTGGACAACAGGGACCCAC 
AGGAGCCCCAGGACCCAGGGGCTTTCCTGGAGAAAAGGGTGCACCAGGAGTCCCTGGTATGAATGGACAGAAAGGGG 
AAATGGGATATGGTGCTCCTGGTCGTCCAGGTGAGAGGGGTCTTCCAGGCCCTCAGGGTCCCACAGGACCATCTGGC 
CCTCCTGGAGTGGGAAAAAGAGGTGAAAATGGGGTTCCAGGACAGCCAGGCATCAAAGGTGATAGAGGTTTTCCGGG 
AGAAATGGGACCAATTGGCCCACCAGGTCCCCAAGGCCCTCCTGGGGAACGAGGGCCAGAAGGCATTGGAAAGCCAG 
GAGCTGCTGGAGCCCCAGGCCAGCCAGGGATTCCAGGAACAAAAGGTCTCCCTGGGGCTCCAGGAATAGCTGGGCCC 
CCAGGGCCTCCTGGCTTTGGGAAACCAGGCTTGCCAGGCCTGAAGGGAGAAAGAGGACCTGCTGGCCTTCCTGGGGG 
TCCAGGTGCCAAAGGGGAACAAGGGCCAGCAGGTCTTCCTGGGAAGCCAGGTCTGACTGGACCCCCTGGGAATATGG 
GACCCCAAGGACCAAAAGGCATCCCGGGTAGCCATGGTCTCCCAGGCCCTAAAGGTGAGACAGGGCCAGCTGGGCCT 
GCAGGATACCCTGGGGCTAAGGGTGAAAGGGGTTCCCCTGGGTCAGATGGAAAACCAGGGTACCCAGGAAAACCAGG 
TCTCGATGGTCCTAAGGGTAACCCAGGGTTACCAGGTCCAAAAGGTGATCCTGGAGTTGGAGGACCTCCTGGTCTCC 
CAGGCCCTGTGGGCCCAGCAGGAGCAAAGGGAATGCCCGGACACAATGGAGAGGCTGGCCCAAGAGGTGCCCCTGGA 
ATACCAGGTACTAGAGGCCCTATTGGGCCACCAGGCATTCCAGGATTCCCTGGGTCTAAAGGGGATCCAGGAAGTCC 
CGGTCCTCCTGGCCCAGCTGGCATAGCAACTAAGGGCCTCAATGGACCCACCGGGCCACCAGGGCCTCCAGGTCCAA 
GAGGCCACTCTGGAGAGCCTGGTCTTCCAGGGCCCCCTGGGCCTCCAGGCCCACCAGGTCAAGCAGTCATGCCTGAG 
GGTTTTATAAAGGCAGGCCAAAGGCCCAGTCTTTCTGGGACCCCTCTTGTTAGTGCCAACCAGGGGGTAACAGGAAT 
GCCTGTGTCTGCTTTTACTGTTATTCTCTCCAAAGCTTACCCAGCAATAGGAACTCCCATACCATTTGATAAAATTT 
TGTATAACAGGCAACAGCATTATGACCCAAGGACTGGAATCTTTACTTGTCAGATACCAGGAATATACTATTTTTCA 
TACCACGTGCATGTGAAAGGGACTCATGTTTGGGTAGGCCTGTATAAGAATGGCACCCCTGTAATGTACACCTATGA 
TGAATACACCAAAGGCTACCTGGATCAGGCTTCAGGGAGTGCCATCATCGATCTCACAGAAAATGACCAGGTGTGGC 
TCCAGCTTCCCAATGCCGAGTCAAATGGCCTATACTCCTCTGAGTATGTCCACTCCTCTTTCTCAGGATTCCTAGTG 
GCTCCAATGTGAGTACAAGGGCGAATTCGTTTAAACCTGCAGGACTAGTCCCTTTAGTGAGGGTTAATT 
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FIGURE 39 

GTTGGTGACCAAGAGTACATCTCTTTTCAAATAGCTGGATTAGGTCCTC ATG CTGCTGTGGTCATTGCTGGTCATCT 
TTGATGCAGTCACTGAACAGGCAGATTCGCTGACCCTTGTGGCGCCCTCTTCTGTCTTCGAAGGAGACAGCATCGTT 
CTGAAATGCCAGGGAGAACAGAACTGGAAAATTCAGAAGATGGCTTACCATAAGGATAACAAAGAGTTATCTGTTTT 
CAAAAAATTCTCAGATTTCCTTATCCAAAGTGCAGTTTTAAGTGACAGTGGTAACTATTTCTGTAGTACCAAAGGAC 
AACTCTTTCTCTGGGATAAAACTTCAAATATAGTAAAGATAAAAGTCCAAGAGCTCTTTCAACGTCCTGTGCTGACT 
GCCAGCTCCTTCCAGCCCATCGAAGGGGGTCCAGTGAGCCTGAAATGTGAGACCCGGCTCTCTCCACAGAGGTTGGA 
TGTTCAACTCCAGTTCTGCTTCTTCAGAGAAAACCAGGTCCTGGGGTCAGGCTGGAGCAGCTCTCCGGAGCTCCAGA 
TTTCTGCCGTGTGGAGTGAAGACACAGGGTCTTACTGGTGCAAGGCAGAAACGGTGACTCACAGGATCAGAAAACAG 
AGCCTCCAATCCCAGATTCACGTGCAGAGAATCCCCATCTCTAATGTAAGCTTGGAGATCCGGGCCCCCGGGGGACA 
GGTGACTGAAGGACAAAAACTGATCCTGCTCTGCTCAGTGGCTGGGGGTACAGGAAATGTCACATTCTCCTGGTACA 
GAGAGGCCACAGGAACCAGTATGGGAAAGAAAACCCAGCGTTCCCTGTCAGCAGAGCTGGAGATCCCAGCTGTGAAA 
GAGAGTGATGCCGGCAAATATTACTGTAGAGCTGACAACGGCCATGTGCCTATCCAGAGCAAGGTGGTGAATATCCC 
TGTGAGAATTCCAGTGTCTCGCCCTGTCCTCACCCTCAGGTCTCCTGGGGCCCAGGCTGCAGTGGGGGACCTGCTGG 
AGCTTCACTGTGAGGCCCTGAGAGGCTCTCCCCCAATCTTGTACCAATTTTATCATGAGGATGTCACCCTTGGGAAC 
AGCTCGGCCCCCTCTGGAGGAGGGGCCTCCTTCAACCTCTCTTTGACTGCAGAACATTCTGGAAACTACTCCTGTGA 
GGCCAACAACGGCCTGGGGGCCCAGTGCAGTGAGGCAGTGCCAGTCTCCATCTCAGGACCTGATGGCTATAGAAGAG 
ACCTCATGACAGCTGGAGTTCTCTGGGGACTGTTTGGTGTCCTTGGTTTCACTGGTGTTGCTTTGCTGTTGTATGCC 
TTGTTCCACAAGATATCAGGAGAAAGTTCTGCCACTAATGAACCCAGAGGGGCTTCCAGGCCAAATCCTCAAGAGTT 
CACCTATTCAAGCCCAACCCCAGACATGGAGGAGCTGCAGCCAGTGTATGTCAATGTGGGCTCTGTAGATGTGGATG 
TGGTTTATTCTCAGGTCTGGAGCATGCAGCAGCCAGAAAGCTCAGCAAACATCAGGACACTTCTGGAGAACAAGGAC 
TCCCAAGTCATCTACTCTTCTGTGAAGAAATCA TAA CACTTGGAGGAATCAGAAGGGAAGATCAACAGCAAGGATGG 
GGCATCATTAAGACTTGCTATAAAACCTTATGAAAATGCTTGAGGCTTATCACCTGCCACAGCCAGAACGTGCCTCA 
GGAGGCACCTCCTGTCATTTTTGTCCTGATGATGTTTCTTCTCCAATATCTTCTTTTACCTATCAATATTCATTGAA 
CTGCTGCTACATCCAGACACTGTGCAAATAAATTATTTCTGCTACCTTCAAAAAAAAAAAAAAAAAAAAATGCAG 
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GGCACGAGGCJTGCGCCAGGGCCTGAGCGGAGGCGGGGGCAGCCTCGCCAGCGGGGGCCCCGGGCCTGGCCATGCCTC 
ACTGAGCCAGCGCCTGCGCCTCTACCTCGCCGACAGCTGGAACCAGTGCGACCTAGTGGCTCTCACCTGCTTCCTCC 
TGGGCGTGGGCTGCCGGCTGACCCCGGGTTTGTACCACCTGGGCCGCACTGTCCTCTGCATCGACTTCATGGTTTTC 
ACGGTGCGGCTGCTTCACATCTTCACGGTCAACAAACAGCTGGGGCCCAAGATCGTCATCGTGAGCAAGATGATGAA 
GGACGTGTTCTTCTTCCTCTTCTTCCTCGGCGTGTGGCTGGTAGCCTATGGCGTGGCCACGGAGGGGCTCCTGAGGC 
CACGGGACAGTGACTTCCCAAGTATCCTGCGCCGCGTCTTCTACCGTCCCTACCTGCAGATCTTCGGGCAGATTCCC 
CAGGAGGACATGGACGTGGCCCTCATGGAGCACAGCAACTGCTCGTCGGAGCCCGGCTTCTGGGCACACCCTCCTGG 
GGCCCAGGCGGGCACCTGCGTCTCCCAGTATGCCAACTGGCTGGTGGTGCTGCTCCTCGTCATCTTCCTGCTCGTGG 
CCAACATCCTGCTGGTCAACTTGCTCATTGCCATGTTCAGTTACACATTCGGCAAAGTACAGGGCAACAGCGATCTC 
TACTGGAAGGCGCAGCGTTACCGCCTCATCCGGGAATTCCACTCTCGGCCCGCGCTGGCCCCGCCCTTTATCGTCAT 
CTCCCACTTGCGCCTCCTGCTCAGGCAATTGTGCAGGCGACCCCGGAGCCCCCAGCCGTCCTCCCCGGCCCTCGAGC 
ATTTCCGGGTTTACCTTTCTAAGGAAGCCGAGCGGAAGCTGCTAACGTGGGAATCGGTGCATAAGGAGAACTTTCTG 
CTGGCACGCGCTAGGGACAAGCGGGAGAGCGACTCCGAGCGTCTGAAGCGCACGTCCCAGAAGGTGGACTTGGCACT 
GAAACAGCTGGGACACATCCGCGAGTACGAACAGCGCCTGAAAGTGCTGGAGCGGGAGGTCCAGCAGTGTAGCCGCG 
TCCTGGGGTGGGTGGCCGAGGCCCTGAGCCGCTCTGCCTTGCTGCCCCCAGGTGGGCCGCCACCCCCTGACCTGCCT 
GGGTCCAAAGACTGAGCCCTGCTGGCGGACTTCAAGGAGAAGCCCCCACAGGGGATTTTGCTCCTAGAGTAAGGCTC 
ATCTGGGCCTCGGCCCCCGCACCTGGTGGCCTTGTCCTTGAGGTGAGCCCCATGTCCATCTGGGCCACTGTCAGGAC 
CACCTTTGGGAGTGTCATCCTTACAAACCACAGCATGCCCGGCTCCTCCCAGAACCAGTCCCAGCCTGGGAGGATCA 
AGGCCTGGATCCCGGGCCGTTATCCATCTGGAGGCTGCAGGGTCCTTGGGGTAACAGGGACCACAGACCCCTCACCA 
C T C AC AG AT T C C T C AC AC T G G G G AAAT AAAG CC AT T T C AG AG G AAAAAAAAAAAAAAAAAA 
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GAGAGAACAGCGTGAGCCTGTGTGCTTGTGTGCTGAGCCCTCATCCCCTCCTGGGGCCAGGCTTGGGTTTCACCTGC 
AGAATCGCTTGTGCTGGGCTGCCTGGGCTGTCCTCAGTGGCACCTGCATGAAGCCGTTCTGGCTGCCAGAGCTGGAC 
AGCCCCAGGAAAACCCACCTCTCTGCAGAGCTTGCCCAGCTGTCCCCGGGAAGCCAAATGCCTCTCATGTAAGTCTT 
CTGCTCGACGGGGTGTCTCCTAAACCCTCACTCTTCAGCCTCTGTTTGACCATGAAATGAAGTGACTGAGCTCTATT 
CTGTACCTGCCACTCTATTTCTGGGGTGACTTTTGTCAGCTGCCCAGAATCTCCAAGCCAGGCTGGTTCTCTGCATC 
CTTTCAATGACCTGTTTTCTTCTGTAACCACAGGTTCGGTGGTGAGAGGAAGCCTCGCAGAATCCAGCAGAATCCTC 
ACAGAATCCAGCAGCAGCTCTGCTGGGGACATGGTCCATGGTGCAACCCACAGCAAAGCCCTGACCTGACCTCCTGA 
TGCTCAGGAGAAGCCATGGGCCCCTCCTGTCCTGTGTTCCTGTCCTTCACAAAGCTCAGCCTGTGGTGGCTCCTTCT 
GACCCCAGCAGGTGGAGAGGAAGCTAAGCGCCCACCTCCCAGGGCTCCTGGAGACCCACTCTCCTCTCCCAGTCCCA 
CGGCATTGCCGCAGGGAGGCTCGCATACCGAGACTGAGGACCGGCTCTTCAAACACCTCTTCCGGGGCTACAACCGC 
TGGGCGCGCCCGGTGCCCAACACTTCAGACGTGGTGATTGTGCGCTTTGGACTGTCCATCGCTCAGCTCATCGATGT 
GGATGAGAAGAACCAAATGATGACCACCAACGTCTGGCTAAAACAGGAGTGGAGCGACTACAAACTGCGCTGGAACC 
CCGCTGATTTTGGCAACATCACATCTCTCAGGGTCCCTTCTGAGATGATCTGGATCCCCGACATTGTTCTCTACAAC 
AATGCAGATGGGGAGTTTGCAGTGACCCACATGACCAAGGCCCACCTCTTCTCCACGGGCACTGTGCACTGGGTGCC 
CCCGGCCATCTACAAGAGCTCCTGCAGCATCGACGTCACCTTCTTCCCCTTCGACCAGCAGAACTGCAAGATGAAGT 
TTGGCTCCTGGACTTATGACAAGGCCAAGATCGACCTGGAGCAGATGGAGCAGACTGTGGACCTGAAGGACTACTGG 
GAGAGCGGCGAGTGGGCCATCGTCAATGCCACGGGCACCTACAACAGCAAGAAGTACGACTGCTGCGCCGAGATCTA 
CCCCGACGTCACCTACGCCTTCGTCATCCGGCGGCTGCCGCTCTTCTACACCATCAACCTCATCATCCCCTGCCTGC 
TCATCTCCTGCCTCACTGTGCTGGTCTTCTACCTGCCCTCCGACTGCGGCGAGAAGATCACGCTGTGCATTTCGGTG 
CTGCTGTCACTCACCGTCTTCCTGCTGCTCATCACTGAGATCATCCCGTCCACCTCGCTGGTCATCCCGCTCATCGG 
CGAGTACCTGCTGTTCACCATGATCTTCGTCACCCTGTCCATCGTCATCACCGTCTTCGTGCTCAATGTGCACCACC 
GCTCCCCCAGCACCCACACCATGCCCCACTGGGTGCGGGGGGCCCTTCTGGGCTGTGTGCCCCGGTGGCTTCTGATG 
AACCGGCCCCCACCACCCGTGGAGCTCTGCCACCCCCTACGCCTGAAGCTCAGCCCCTCTTATCACTGGCTGGAGAG 
CAACGTGGATGCCGAGGAGAGGGAGGTGGTGGTGGAGGAGGAGGACAGATGGGCATGTGCAGGTCATGTGGCCCCCT 
CTGTGGGCACCCTCTGCAGCCACGGCCACCTGCACTCTGGGGCCTCAGGTCCCAAGGCTGAGGCTCTGCTGCAGGAG 
GGTGAGCTGCTGCTATCACCCCACATGCAGAAGGCACTGGAAGGTGTGCACTACATTGCCGACCACCTGCGGTCTGA 
GGATGCTGACTCTTCGGTGAAGGAGGACTGGAAGTATGTTGCCATGGTCATCGACAGGATCTTCCTCTGGCTGTTTA 
TCATCGTCTGCTTCCTGGGGACCATCGGCCTCTTTCTGCCTCCGTTCCTAGCTGGAATGATC TGA CTGCACCTCCCT 
CGAGCTGGCTCCCAGGGCAAAGGGGAGGGTTCTTGGATGTGGAAGGGCTTTGAACAATGTTTAGATTTGGAGATGAG 
CCCAAAGTGCCAGGGAGAACAGCCAGGTGAGGTGGGAGGTTGGAGAGCCAGGTGAGGTCTCTCTAAGTCAGGCTGGG 
GTTGAAGTTTGGAGTCTGTCCGAGTTTGCAGGGTGCTGAGCTGTATGGTCCAGCAGGGGAGTAATAAGGGCTCTTCC 
GGAAGGGGAGGAAGCGGGAGGCAGGCCTGCACCTGATGTGGAGGTACAGGCAGATCTTCCCTACCGGGGAGGGATGG 
ATGGTTGGATACAGGTGGCTGGGCTATTCCATCCATCTGGAAGCACATTTGAGCCTCCAGGCTTCTCCTTGACGTCA 
TTCCTCTCCTTCCTTGCTGCAAAATGGCTCTGCACCAGCCGGCCCCCAGGAGGTCTGGCAGAGCTGAGAGCCATGGC 
CTGCAGGGGCTCCATATGTCCCTACGCGTGCAGCAGGCAAACAAGA 
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CCGAGACTCACGGTCAAGCTAAGGCGAAGAGTGGGTGGCTGAAGCCATACTATTTTATAGAATTAATGGAAAGCAGA 
AAAGACATCACAAACCAAGAAGAACTTTGGAAAATGAAGCCTAGGAGAAATTTAGAAGAAGACGATTATTTGCATAA 
GGACACGGGAGAGACCAGCATGCTAAAAAGACCTGTGCTTTTGCATTTGCACCAAACAGCCCATGCTGATGAATTTG 
ACTGCCCTTCAGAACTTCAGCACACACAGGAACTCTTTCCACAGTGGCACTTGCCAATTAAAATAGCTGCTATTATA 
GCATCTCTGACTTTTCTTTACACTCTTCTGAGGGAAGTAATTCACCCTTTAGCAACTTCCCATCAACAATATTTTTA 
TAAAATTCCAATCCTGGTCATCAACAAAGTCTTGCCAATGGTTTCCATCACTCTCTTGGCATTGGTTTACCTGCCAG 
GTGTGATAGCAGCAATTGTCCAACTTCATAATGGAACCAAGTATAAGAAGTTTCCACATTGGTTGGATAAGTGGATG 
TTAACAAGAAAGCAGTTTGGGCTTCTCAGTTTCTTTTTTGCTGTACTGCATGCAATTTATAGTCTGTCTTACCCAAT 
GAGGCGATCCTACAGATACAAGTTGCTAAACTGGGCATATCAACAGGTCCAACAAAATAAAGAAGATGCCTGGATTG 
AGCATGATGTTTGGAGAATGGAGATTTATGTGTCTCTGGGAATTGTGGGATTGGCAATACTGGCTCTGTTGGCTGTG 
ACATCTATTCCATCTGTGAGTGACTCTTTGACATGGAGAGAATTTCACTATATTCAGAGCAAGCTAGGAATTGTTTC 
CCTTCTACTGGGCACAATACACGCATTGATTTTTGCCTGGAATAAGTGGATAGATATAAAACAATTTGTATGGTATA 
CACCTCCAACTTTTATGATAGCTGTTTTCCTTCCAATTGTTGTCCTGATATTTAAAAGCATACTATTCCTGCCATGC 
TTGAGGAAGAAGATACTGAAGATTAGACATGGTTGGGAAGACGTCACCAAAATTAACAAAACTGAGATATGTTCCCA 
GTTGTAGAATTACTGTTTACACACATTTTTGTTCAATATTGATATATTTTATCACCAACATTTCAAGTTTGTATTTG 
T T AAT AAAAT GAT T AT T C AAGG AAAAAAAAAAAAAAAAAA 
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GTGGCGCCCCCGGACGCCCACCCCGCAGTGGCCATGGAGAAGCCTTCGGGGCAGCCGTGGTGACCGTGTGGGACAGC 
GATGCACAACACCACGGAGAAGCCCACCGATGCCTACGGAGAGCTGGACTTCACGGGGGCCGGCCGCAAGCACAGCA 
ATTTCCTCCGGCTCTCTGACCGAACGGATCCAGCTGCAGTTTATAGTCTGGTCACACGCACATGGGGCTTCCGTGCC 
CCGAACCTGGTGGTGTCAGTGCTGGGGGGATCGGGGGGCCCCGTCCTCCAGACCTGGCTGCAGGACCTGCTGCGTCG 
TGGGCTGGTGCGGGCTGCCCAGAGCACAGGAGCCTGGATTGTCACTGGGGGTCTGCACACGGGCATCGGCCGGCATG 
TTGGTGTGGCTGTACGGGACCATCAGATGGCCAGCACTGGGGGCACCAAGGTGGTGGCCATGGGTGTGGCCCCCTGG 
GGTGTGGTCCGGAATAGAGACACCCTCATCAACCCCAAGGGCTCGTTCCCTGCGAGGTACCGGTGGCGCGGTGACCC 
GGAGGACGGGGTCCAGTTTCCCCTGGACTACAACTACTCGGCCTTCTTCCTGGTGGACGACGGCACACACGGCTGCC 
TGGGGGGCGAGAACCGCTTCCGCTTGCGCCTGGAGTCCTACATCTCACAGCAGAAGACGGGCGTGGGAGGGACTGGA 
ATTGACATCCCTGTCCTGCTCCTCCTGATTGATGGTGATGAGAAGATGTTGACGCGAATAGAGAACGCCACCCAGGC 
TCAGCTCCCATGTCTCCTCGTGGCTGGCTCAGGGGGAGCTGCGGACTGCCTGGCGGAGACCCTGGAAGACACTCTGG 
CCCCAGGGAGTGGGGGAGCCAGGCAAGGCGAAGCCCGAGATCGAATCAGGCGTTTCTTTCCCAAAGGGGACCTTGAG 
GTCCTGCAGGCCCAGGTGGAGAGGATTATGACCCGGAAGGAGCTCCTGACAGTCTATTCTTCTGAGGATGGGTCTGA 
GGAATTCGAGACCATAGTTTTGAAGGCCCTTGTGAAGGCCTGTGGGAGCTCGGAGGCCTCAGCCTACCTGGATGAGC 
TGCGTTTGGCTGTGGCTTGGAACCGCGTGGACATTGCCCAGAGTGAACTCTTTCGGGGGGACATCCAATGGCGGTCC 
TTCCATCTCGAAGCTTCCCTCATGGACGCCCTGCTGAATGACCGGCCTGAGTTCGTGCGCTTGCTCATTTCCCACGG 
CCTCAGCCTGGGCCACTTCCTGACCCCGATGCGCCTGGCCCAACTCTACAGCGCGGCGCCCTCCAACTCGCTCATCC 
GCAACCTTTTGGACCAGGCGTCCCACAGCGCAGGCACCAAAGCCCCAGCCCTAAAAGGGGGAGCTGCGGAGCTCCGG 
CCCCCTGACGTGGGGCATGTGCTGAGGATGCTGCTGGGGAAGATGTGCGCGCCGAGGTACCCCTCCGGGGGCGCCTG 
GGACCCTCACCCAGGCCAGGGCTTCGGGGAGAGCATGTATCTGCTCTCGGACAAGGCCACCTCGCCGCTCTCGCTGG 
ATGCTGGCCTCGGGCAGGCCCCCTGGAGCGACCTGCTTCTTTGGGCACTGTTGCTGAACAGGGCACAGATGGCCATG 
TACTTCTGGGAGATGGGTTCCAATGCAGTTTCCTCAGCTCTTGGGGCCTGTTTGCTGCTCCGGGTGATGGCACGCCT 
GGAGCCTGACGCTGAGGAGGCAGCACGGAGGAAAGACCTGGCGTTCAAGTTTGAGGGGATGGGCGTTGACCTCTTTG 
GCGAGTGCTATCGCAGCAGTGAGGTGAGGGCTGCCCGCCTCCTCCTCCGTCGCTGCCCGCTCTGGGGGGATGCCACT 
TGCCTCCAGCTGGCCATGCAAGCTGACGCCCGTGCCTTCTTTGCCCAGGATGGGGTACAGTCTCTGCTGACACAGAA 
GTGGTGGGGAGATATGGCCAGCACTACACCCATCTGGGCCCTGGTTCTCGCCTTCTTTTGCCCTCCACTCATCTACA 
CCCGCCTCATCACCTTCAGGAAATCAGAAGAGGAGCCCACACGGGAGGAGCTAGAGTTTGACATGGATAGTGTCATT 
AATGGGGAAGGGCCTGTCGGGACGGCGGACCCAGCCGAGAAGACGCCGCTGGGGGTCCCGCGCCAGTCGGGCCGTCC 
GGGTTGCTGCGGGGGCCGCTGCGGGGGGCGCCGGTGCCTACGCCGCTGGTTCCACTTCTGGGGCGCGCCGGTGACCA 
TCTTCATGGGCAACGTGGTCAGCTACCTGCTGTTCTTGCTGCTTTTCTCGCGGGTGCTGCTCGTGGATTTCCAGCCG 
GCGCCGCCCGGCTCCCTGGAGCTGCTGCTCTATTTCTGGGCTTTCACGCTGCTGTGCGAGGAACTGCGCCAGGGCCT 
GAGCGGAGGCGGGGGCAGCCTCGCCAGCGGGGGCCCCGGGCCTGGCCATGCCTCACTGAGCCAGCGCCTGCGCCTCT 
ACCTCGCCGACAGCTGGAACCAGTGCGACCTAGTGGCTCTCACCTGCTTCCTCCTGGGCGTGGGCTGCCGGCTGACC 
CCGGGTTTGTACCACCTGGGCCGCACTGTCCTCTGCATCGACTTCATGGTTTTCACGGTGCGGCTGCTTCACATCTT 
CACGGTCAACAAACAGCTGGGGCCCAAGATCGTCATCGTGAGCAAGATGATGAAGGACGTGTTCTTCTTCCTCTTCT 
TCCTCGGCGTGTGGCTGGTAGCCTATGGCGTGGCCACGGAGGGGCTCCTGAGGCCACGGGACAGTGACTTCCCAAGT 
ATCCTGCGCCGCGTCTTCTACCGTCCCTACCTGCAGATCTTCGGGCAGATTCCCCAGGAGGACATGGACGTGGCCCT 
CATGGAGCACAGCAACTGCTCGTCGGAGCCCGGCTTCTGGGCACACCCTCCTGGGGCCCAGGCGGGCACCTGCGTCT 
CCCAGTATGCCAACTGGCTGGTGGTGCTGCTCCTCGTCATCTTCCTGCTCGTGGCCAACATCCTGCTGGTCAACTTG 
CTCATTGCCATGTTCAGTTACACATTCGGCAAAGTACAGGGCAACAGCGATCTCTACTGGAAGGCGCAGCGTTACCG 
CCTCATCCGGGAATTCCACTCTCGGCCCGCGCTGGCCCCGCCCTTTATCGTCATCTCCCACTTGCGCCTCCTGCTCA 
GGCAATTGTGCAGGCGACCCCGGAGCCCCCAGCCGTCCTCCCCGGCCCTCGAGCATTTCCGGGTTTACCTTTCTAAG 
GAAGCCGAGCGGAAGCTGCTAACGTGGGAATCGGTGCATAAGGAGAACTTTCTGCTGGCACGCGCTAGGGACAAGCG 
GGAGAGCGACTCCGAGCGTCTGGAGCGCACGTCCCAGAAGGTGGACTTGGCACTGAAACAGCTGGGACACATCCGCG 
AGTACGAACAGCGCCTGAAAGTGCTGGAGCGGGAGGTCCAGCAGTGTAGCCGCGTCCTGGGGTGGGTGACGTAGGCC 
GTTAGCAGCTCTGCCATGTTGCCCTCAGGTGGGCCGCCACCCCTTGACCTGCATGGGTCCAAAGAGTGAGCCATGCT 
GGCGGATTTTAAGGAGAAGCCCCCACAGGGGATTTTGCTCTTAGAGTAAGGCTCATGTGGGCCTCGGCCCCCGCACC 
TGGTGGCCTTGTCCTTGAGGTGAGCCCCATGTCCATCTGGGCCACTGTCAGGACCACCTTTGGGAGTGTCATCCTTA 
CAAACCACAGCATGCCCGGCTCCTCCCAGAACCAGTCCCAGCCTGGGAGGATCAAGGCCTGGATCCCGGGCCGTTAT 
CCATCTGGAGGCTGCAGGGTCCTTGGGGTAACAGGGACCACAGACCCCTCACCACTCACAGATTCCTCACACTGGGG 
AAAT AAAGCC AT T T C AG AGG AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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CGATAATTGAAAACCCAGATGTCCCACAGGATTTCGGGAATCAAGGGTCAACAGTAGAGTCCCTCTGTGATGATGTT 
GTCTCTGTACTCACCGTCCTGTGTGAGAAGCTGCAAGCCGCCATAAATGACAGCCAGCAGCTGCAGCTTCTCTACCT 
GGAGTGCATCCTGTCTGTGCTCAGCAGCTCCTCCTCCTCCATGCACCTGCACAGGCGCTTCACGGACCTGATCTGGA 
AAAACCTCTGCCCTGCTCTCATCGTGATCTTGGGGAATCCAATTCATGACAAAACCATCACCTCTGCTCACACCAGC 
AGCACCAGTACCAGCCTGGAGTCGGACTCTGCGTCTCCGGGAGTGTCTGACCACGGCCGAGGATCAGGCTGCTCCTG 
CACTGCGCCGGCCCTGAGCGGACCTGTGGCTCGGACTATCTATTACATCGCAGCCGAGCTGGTCCGGCTGGTGGGGT 
CTGTGGACTCCATGAAGCCCGTGCTCCAGTCCCTCTACCACCGAGTGCTGCTCTACCCCCCACCCCAGCACCGGGTG 
GAAGCCATCAAAATAATGAAAGAGATACTTGGGAGCCCACAGCGTCTCTGTGACTTGGCAGGACCCAGCTCCACTGA 
ATCAGAGTCCAGAAAAAGATCAATTTCAAAAAGAAAGTCTCATCTGGATCTCCTCAAACTCATCATGGATGGCATGA 
CCGAAGCATGCATCAAGGGTGGCATCGAAGCTTGCTATGCAGCCGTGTCCTGTGTCTGCACCTTGCTGGGTGCCCTG 
GATGAGCTCAGCCAGGGGAAGGGCTTGAGCGAAGGTCAGGTGCAACTGCTGCTTCTGCGCCTTGAGGAGCTGAAGGA 
TGGGGCTGAGTGGAGCCGAGATTCCATGGAGATCAATGAGGCTGACTTCCGCTGGCAGCGGCGAGTGCTGTCCTCAG 
AACACACGCCGTGGGAGTCAGGGAACGAGAGGAGCCTTGACATCAGCATCAGTGTCACCACAGACACAGGCCAGACC 
ACTCTCGAGGGAGAGTTGGGTCAGACTACACCCGAGGACCATTCGGGAAACCACAAGAACAGTCTCAAGTCGCCAGC 
CATCCCAGAGGGTAAGGAGACGCTGAGCAAAGTATTGGAAACAGAGGCGGTAGACCAGCCAGATGTCGTGCAGAGAA 
GCCACACGGTCCCTTACCCTGACATAACTAACTTCCTGTCAGTAGACTGCAGGACAAGGTCCTATGGATCTAGGTAT 
AGTGAGAGCAATTTTAGCGTTGATGACCAAGACCTTTCTAGGACAGAGTTTGATTCCTGTGATCAGTACTCTATGGC 
AGCAGAAAAGGACTCGGGCAGGTCCGACGTGTCAGACATTGGGTCGGACAACTGTTCACTAGCCGATGAAGAGCAGA 
CACCCCGGGACTGCCTAGGCCACCGGTCCCTGCGAACTGCCGCCCTGTCTCTAAAACTGCTGAAGAACCAGGAGGCG 
GATCAGCACAGCGCCAGGCTGTTCATACAGTCCCTGGAAGGCCTCCTCCCTCGGCTCCTGTCTCTCTCCAATGTAGA 
GGAGGTGGACACCGCTCTGCAGAACTTTGCCTCTACTTTCTGCTCAGGCATGATGCACTCTCCTGGCTTTGACGGGA 
ATAGCAGCCTCAGCTTCCAGATGCTGATGAACGCAGACAGCCTCTACACAGCTGCACACTGCGCCCTGCTCCTCAAC 
CTGAAGCTCTCCCACGGTGACTACTACAGGAAGCGGCCGACCCTGGCGCCAGGCGTGATGAAGGACTTCATGAAGCA 
GGTGCAGACCAGCGGCGTGCTGATGGTCTTCTCTCAGGCCTGGATTGAGGAGCTCTACCATCAGGTGCTCGACAGGA 
ACATGCTTGGAGAGGCTGGCTATTGGGGCAGCCCAGAAGATAACAGCCTTCCCCTCATCACAATGCTGACCGATATT 
GACGGCTTAGAGAGCAGTGCCATTGGTGGCCAGCTGATGGCCTCGGCTGCTACAGAGTCTCCTTTCGCCCAGAGCAG 
GAGAATTGATGACTCCACAGTGGCAGGCGTGGCATTTGCTCGCTATATTCTGGTGGGCTGCTGGAAGAACTTGATCG 
ATACTTTATCAACCCCACTGACTGGTCGAATGGCGGGGAGCTCCAAAGGGCTGGCCTTCATTCTGGGAGCTGAAGGC 
ATCAAAGAGCAGAACCAGAAGGAGCGGGACGCCATCTGCATGAGCCTCGACGGGCTGCGGAAAGCCGCACGGCTGAG 
CTGCGCTCTAGGCGTTGCTGCTAACTGCGCCTCAGCCCTTGCCCAGATGGCAGCTGCCTCCTGTGTCCAAGAAGAAA 
AAGAAGAGAGGGAGGCCCAAGAACCCAGTGATGCCATCACACAAGTGAAACTAAAAGTGGAGCAGAAACTGGAGCAG 
ATTGGGAAGGTGCAGGGGGTGTGGCTGCACACTGCCCACGTCTTGTGCATGGAGGCCATCCTCAGCGTAGGCCTGGA 
GATGGGAAGCCACAACCCGGACTGCTGGCCACACGTGTTCAGGGTGTGTGAATACGTGGGCACCCTGGAGCACAACC 
ACTTCAGCGATGGTGCCTCGCAGCCCCCTCTGACCATCAGCCAGCCCCAGAAGGCCACTGGAAGCGCTGGCCTCCTT 
GGGGACCCCGAGTGTGAGGGCTCGCCCCCCGAGCACAGCCCGGAGCAGGGGCGCTCCCTGAGCACGGCCCCTGTCGT 
CCAGCCCCTGTCCATCCAGGACCTCGTCCGGGAAGGCAGCCGGGGTCGGGCCTCCGACTTCCGCGGCGGGAGCCTCA 
TGAGCGGGAGCAGCGCGGCCAAGGTGGTGCTCACCCTCTCCACGCAAGCCGACAGGCTCTTTGAAGATGCTACGGAT 
AAGTTGAACCTCATGGCCTTGGGAGGTTTTCTTTACCAGCTGAAGAAAGCATCGCAGTCTCAGCTTTTCCATTCTGT 
TACAGATACAGTTGATTACTCTCTGGCAATGCCAGGAGAAGTTAAATCCACTCAAGACCGAAAAAGCGCCCTCCACC 
TGTTCCGCCTGGGGAATGCCATGCTGAGGATTGTGCGGAGCAAAGCACGGCCCCTGCTCCACGTGATGCGCTGCTGG 
AGCCTTGTGGCCCCACACCTGGTGGAGGCTGCTTGCCATAAGGAAAGACATGTGTCTCAGAAGGCTGTTTCCTTCAT 
CCATGACATACTGACAGAAGTCCTCACTGACTGGAATGAGCCACCTCATTTTCACTTCAATGAAGCACTCTTCCGAC 
CTTTCGAGCGCATTATGCAGCTGGAATTGTGTGATGAGGACGTCCAAGACCAGGTTGTCACATCCATTGGTGAGCTG 
GTTGAAGTGTGTTCCACGCAGATCCAGTCGGGATGGAGACCCTTGTTCAGTGCCCTGGAAACAGTGCATGGCGGGAA 
CAAGTCAGAGATGAAGGAGTACCTGGTTGGTGACTACTCCATGGGAAAAGGCCAAGCTCCAGTGTTTGATGTATTTG 
AAGCTTTTCTCAATACTGACAACATCCAGGTCTTTGCTAATGCAGCCACTAGCTACATCATGTGCCTTATGAAGTTT 
GTCAAAGGACTGGGGGAGGTGGACTGTAAAGAGATTGGAGACTGTGCCCCAGCACCCGGAGCCCCGTCCACAGACCT 
GTGCCTCCCGGCCCTGGATTACCTCAGGCGCTGCTCTCAGTTATTGGCCAAAATCTACAAAATGCCCTTGAAGCCAA 
TATTCCTTAGTGGGAGACTTGCCGGCTTGCCTCGAAGACTTCAGGAACAGTCAGCCAGCAGTGAGGATGGAATTGAA 
TCAGTCCTGTCTGATTTTGATGATGACACCGGTCTGATAGAAGTCTGGATAATCCTGCTGGAGCAGCTGACAGCGGC 
TGTGTCCAATTGTCCACGGCAGCACCAACCACCAACTCTGGATTTACTCTTTGAGCTGTTGAGAGATGTGACGAAAA 
CACCAGGACCAGGGTTTGGTATCTATGCAGTGGTTCACCTCCTCCTTCCTGTGATGTCCGTTTGGCTCC 
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GCCGGAGCCATAAAGACCATTCCTACTGGGATATGGCCTCTGCCAATTTCAAGCACGCTATTGGTCTGTCCTGTGAG 
CTGGTGGTGGAGCACATTCAAAGCTTTCTACATTCAGATATCAGGTACGAGAGCATGATCAATACCATGCTGAAGGA 
CCTCTTTGAGTTGCTGGTCGCCTGTGTGGCCAAGCCCACTGAAACCATCTCCAGAGTGGGCTGCTCCTGTATTAGAT 
ACGTCCTTGTGACAGCGGGCCCTGTGTTCACTGAGGAGATGTGGAGGCTTGCCTGCTGTGCCCTGCAAGATGCGTTC 
TCTGCCACACTCAAGCCAGTGAAGGACCTGCTGGGCTGCTTCCACAGCGGCACGGAGAGCTTCAGCGGGGAAGGCTG 
CCAGGTGCGAGTGGCGGCCCCGTCCTCCTCCCCAAGTGCCGAGGCCGAGTACTGGCGCATCCGAGCCATGGCCCAGC 
AGGTGTTTATGCTGGACACCCAGTGCTCACCAAAGACACCAAACAACTTTGACCACGCTCAGTCCTGCCAGCTCATT 
ATTGAGCTGCCTCCTGATGAAAAACCAAATGGACACACCAAGAAAAGCGTGTCTTTCAGGGAAATTGTGGTGAGCCT 
GCTGTCTCATCAGGTGTTACTCCAGAACTTATATGACATCTTGTTAGAAGAGTTTGTCAAAGGCCCCTCTCCTGGAG 
AGGAAAAGACGATACAAGTGCCAGAAGCCAAGCTGGCTGGCTTCCTCAGATACATCTCTATGCAGAACTTGGCAGTC 
ATATTCGACCTGCTGCTGGACTCTTATAGGACTGCCAGGGAGTTTGACACCAGCCCCGGGCTGAAGTGCCTGCTGAA 
GAAAGTGTCTGGCATCGGGGGCGCCGCCAACCTCTACCGCCAGTCTGCGATGAGCTTTAACATTTATTTCCACGCCC 
TGGTGTGTGCTGTTCTCACCAATCAAGAAACCATCACGGCCGAGCAAGTGAAGAAGGTCCTTTTTGAGGACGACGAG 
AGAAGCACGGATTCTTCCCAGCAGTGTTCATCTGAGGATGAAGACATCTTTGAGGAAACCGCCCAGGTCAGCCCCCC 
GAGAGGCAAGGAGAAGAGACAGTGGCGGGCACGGATGCCCTTGCTCAGCGTCCAGCCTGTCAGCAACGCAGATTGGG 
TGTGGCTGGTCAAGAGGCTGCACAAGCTGTGCATGGAACTGTGCAACAACTACATCCAGATGCACTTGGACCTGGAG 
AACTGTATGGAGGAGCCTCCCATCTTCAAGGGCGACCCGTTCTTCATCCTGCCCTCCTTCCAGTCCGAGTCATCCAC 
CCCATCCACCGGGGGCTTCTCTGGGAAAGAAACCCCTTCCGAGGATGACAGAAGCCAGTCCCGGGAGCACATGGGCG 
AGTCCCTGAGCCTGAAGGCCGGTGGTGGGGACCTGCTGCTGCCCCCCAGCCCCAAAGTGGAGAAGAAGGATCCCAGC 
CGGAAGAAGGAGTGGTGGGAGAATGCGGGGAACAAAATCTACACCATGGCAGCCGACAAGACCATTTCAAAGTTGAT 
GACCGAATACAAAAAGAGGAAACAGCAGCACAACCTGTCCGCGTTCCCCAAAGAGGTCAAAGTGGAGAAGAAAGGAG 
AGCCACTGGGTCCCAGGGGCCAGGACTCCCCGCTGCTTCAGCGTCCCCAGCACTTGATGGACCAAGGGCAAATGCGG 
CATTCCTTCAGCGCAGGCCCCGAGCTGCTGCGACAGGACAAGAGGCCCCGCTCAGGCTCCACCGGGAGCTCCCTCAG 
TGTCTCGGTGAGAGACGCAGAAGCACAGATCCAGGCATGGACCAACATGGTGCTAACAGTTCTCAATCAGATTCAGA 
TTCTCCCAGACCAGACCTTCACGGCCCTCCAGCCCGCAGTGTTCCCGTGCATCAGTCAGCTGACCTGTCACGTGACC 
GACATCAGAGTTCGCCAGGCTGTGAGGGAGTGGCTGGGCAGGGTGGGCCGTGTCTATGACATCATTGTG TAG CCGAC 
TCCTGTTCTACTCTCCCACCAAATAACAGTAGTGAGGGTTAGAGTCCTGCCAATACAGCTGTTGCATTTTCCCCACC 
ACTAGCCCCACTTAAACTACTACTACTGTCTCAGAGAACAGTGTTTCCTAATGTAAAAAGCCTTTCCAACCACTGAT 
CAGCATTGGGGCCATACTAAGGTTTGTATCTAGATGACACAAACGATATTCTGATTTTGCACATTATTATAGAAGAA 
TCTATAATCCTTGATATGTTTCTAACTCTTGAAGTATATTTCCCAGTGCTTTTGCTTACAGTGTTGTCCCCAAATGG 
GTCATTTTCAAGGATTACTCATTTGAAAACACTATATTGATCCATTTGATCCATCATTTAAAAAATAAATACAATTC 
CTAAGGCAATATCTGCTGGTAAGTCAAGCTGATAAACACTCAGACATCTAGTACCAGGGATTATTAATTGGAGGAAG 
ATTTATGGTTATGGGTCTGGCTGGGAAGAAGACAACTATAAATACATATTCTTGGGTGTCATAATCAAGAAAGAGGT 
GACTTCTGTTGTAAAATAATCCAGAACACTTCAAAATTATTCCTAAATCATTAAGATTTTCAGGTATTCACCAATTT 
CCCCATGTAAGGTACTGTGTTGTACCTTTATTTCTGTATTTCTAAAAGAAGAAAGTTCTTTCCTAGCAGGGTTTGAA 
GTCTGTGGCTTATCAGCCTGTGACACAGAGTACCCAGTGAAAGTGGCTGGTACGTAGATTGTCAAGAGACATAAGAC 
CGACCAGCCACCCTGGCTGTTCTTGTGGTGTTTGTTTCCATCCCCAAGGCAAACAAGGAAAGGAAAGGAAAGAAGAA 
AAGGTGCCTTAGTCCTTTGTTGCACTTCCATTTCCATGCCCCACAATTGTCTGAACATAAGGTATAGCATTTGGTTT 
TTAAGAAAACAAAACATTAAGACGCAACTCATTTTATATCAACACGCTTGGAGGAAAGGGACTCAGGGAAGGGAGCA 
GGGAGTGTGGGGTGGGGATGGATTATGATGAAATCATTTTCAATCTTAAAATATAATACAACAATCTTGCAAAATTA 
TGGTGTCAGTTACACAAGCTCTAGTCTCAAAATGAAAGTAATGGAGAAAGACACTGAAATTTAGAAAATTTTGTCGA 
TTTAAAATATTTCTCCTATCTACCAAGTAAAGTTACCCTATGTTTGATGTCTTTGCATTCAGACCAATATTTCAGGT 
GGATATTTCTAAGTATTACTAGAAAATACGTTTGAAAGCTTTATCTTATTATTTACAGTATTTTTATATTTCTTACA 
TTATCCTAATGATTGAAAACTCCTCAATCAAGCTTACTTACACACATTCTACAGAGTTATTTAAGGCATACATTATA 
ATCTCCCAGCCCCATTCATAATGAATAAGTCACCCTTTAAATATAAGACACAAATTCTACAGTATTGAAATAAGGAT 
TTAAAGGGGTATTTGTAAACTTTGCCCTCCTTGAGAAATATGGAACTACCTTAGAGGTTAAGAGGAAGGCAGTGTTC 
TGACTTCTTTAGGTGATCTGAAAAAAACACCCTTATCATCCAGTGTACCATCTAGAGATCACCACAGAATCCATTTT 
TTTCCCAGTTCCACAAAACACTCTGTTTGCCTTCAGTTTTTACTCACTAGACAATAATTCAAGTTTAGAAACAGGTA 
ATCAGCTATTTGATCTTAAAAGGCAATGAATTGTTGGGATATCAGTGAACTATGTTGTATACTTTTGAATTTTTACA 
TTTTATAAATGGAATTGAAAGTTGGATAACTGCTTTTTTTAAATTTTCCAACAGAAGTAACACCACAGTTGCTTT 
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GTTTCTTTTTATAGCTTACCTGAGGTTCAGTTCTTCTTTGTGAACCTGTGAGTACTCCACAGTTTACTGGGGGAAAA 
GGCTTCAGTAAAGCAGAGGCTAGAATTACAGTATTTATACATAGCAACTTTTCATAAAGTAGAAAAATTCAAAGGAA 
GCTGTCTCAATTTGAGAATACCAGCTGGGCACGGTCG 
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CAGAGAGGCTGTATTTCAGTGCAGCCTGCCAGACCTCTTCTGGAGGAAGACTGGACAAAGGGGGTCACACATTCCTT 
CCATACGGTTGAGCCTCTACCTGCCTGGTGCTGGTCACAGTTCAGCTTCTTCATGATGGTGGATCCCAATGGCAATG 
AATCCAGTGCTACATACTTCATCCTAATAGGCCTCCCTGGTTTAGAAGAGGCTCAGTTCTGGTTGGCCTTCCCATTG 
TGCTCCCTCTACCTTATTGCTGTGCTAGGTAACTTGACAATCATCTACATTGTGCGGACTGAGCACAGCCTGCATGA 
GCCCATGTATATATTTCTTTGCATGCTTTCAGGCATTGACATCCTCATCTCCACCTCATCCATGCCCAAAATGCTGG 
CCATCTTCTGGTTCAATTCCACTACCATCCAGTTTGATGCTTGTCTGCTACAGATTTTTGCCATCCACTCCTTATCT 
GGCATGGAATCCACAGTGCTGCTGGCCATGGCTTTTGACCGCTATGTGGCCATCTGTCACCCACTGCGCCATGCCAC 
AGTACTTACGTTGCCTCGTGTCACCAAAATTGGTGTGGCTGCTGTGGTGCGGGGGGCTGCACTGATGGCACCCCTTC 
CTGTCTTCATCAAGCAGCTGCCCTTCTGCCGCTCCAATATCCTTTCCCATTCCTACTGCCTACACCAAGATGTCATG 
AAGCTGGCCTGTGATGATATCCGGGTCAATGTCGTCTATGGCCTTATCGTCATCATCTCCGCCATTGGCCTGGACTC 
ACTTCTCATCTCCTTCTCATATCTGCTTATTCTTAAGACTGTGTTGGGCTTGACACGTGAAGCCCAGGCCAAGGCAT 
TTGGCACTTGCGTCTCTCATGTGTGTGCTGTGTTCATATTCTATGTACCTTTCATTGGATTGTCCATGGTGCATCGC 
TTTAGCAAGCGGCGTGACTCTCCGCTGCCCGTCATCTTGGCCAATATCTATCTGCTGGTTCCTCCTGTGCTCAACCC 
AATTGTCTATGGAGTGAAGACAAAGGAGATTCGACAGCGCATCCTTCGACTTTTCCATGTGGCCACACACGCTTCAG 
AGCCCTAGGTGTCAGTGATCAAACTTCTTTTCCATTCAGAGTCCTCTGATTCAGATTTTAATGTTAACATTTTGGAA 
GACAGTATTCAGAAAAAAAATTTCCTTAATAAAAAATACAACTCAGATCCTTCAAATATGAAACTGGTTGGGGAATC 
TCCATTTTTTCAATATTATTTTCTTCTTTGTTTTCTTGCTACATATAATTATTAATACCCTGACTAGGTTGTGGTTG 
GAGGGTTATTACTTTTCATTTTACCATGCAGTCCAAATCTAAACTGCTTCTACTGATGGTTTACAGCATTCTGAGAT 
AAGAATGGTACATCTAGAGAACATTTGCCAAAGGCCTAAGCACGGCAAAGGAAAATAAACACAGAATATAATAAAAT 
GAGATAATCTAGCTTAAAACTATAACTTCCTCTTCAGAACTCCCAACCACATTGGATCTCAGAAAAATGCTGTCTTC 
AAAATGACTTCTACAGAGAAGAAATAATTTTTCCTCTGGACACTAGCACTTAAGGGGAAGATTGGAAGTAAAGCCTT 
GAAAAGAGTACATTTACCTACGTTAATGAAAGTTGACACACTGTTCTGAGAGTTTTCACAGCATATGGACCCTGTTT 
TTCCTATTTAATTTTCTTATCAACCCTTTAATTAGGCAAAGATATTATTAGTACCCTCATTGTAGCCATGGGAAAAT 
TGATGTTCAGTGGGGATCAGTGAATTAAATGGGGTCATACAAGTATAAAAATTAAAAAAAAAAAAGACTTCATGCCC 
AATCTCATATGATGTGGAAGAACTGTTAGAGAGACCAACAGGGTAGTGGGTTAGAGATTTCCAGAGTCTTACATTTT 
CTAGAGGAGGTATTTAATTTCTTCTCACTCATCCAGTGTTGTATTTAGGAATTTCCTGGCAACAGAACTCATGGCTT 
TAATCCCACTAGCTATTGCTTATTGTCCTGGTCCAATTGCCAATTACCTGTGTCTTGGAAGAAGTGATTTCTAGGTT 
CACCATTATGGAAGATTCTTATTCAGAAAGTCTGCATAGGGCTTATAGCAAGTTATTTATTTTTAAAAGTTCCATAG 
GTGATTCTGATAGGCAGTGAGGTTAGGGAGCCACCAGTTATGATGGGAAGTATGGAATGGCAGGTCTTGAAGATAAC 
ATTGGCCTTTTGAGTGTGACTCGTAGCTGGAAAGTGAGGGAATCTTCAGGACCATGCTTTATTTGGGGCTTTGTGCA 
GTATGGAACAGGGACTTTGAGACCAGGAAAGCAATCTGACTTAGGCATGGGAATCAGGCATTTTTGCTTCTGAGGGG 
CTATTACCAAGGGTTAATAGGTTTCATCTTCAACAGGATATGACAACAGTGTTAACCAAGAAACTCAAATTACAAAT 
ACTAAAACATGTGATCATATATGTGGTAAGTTTCATTTTCTTTTTCAATCCTCAGGTTCCCTGATATGGATTCCTAT 
AACATGCTTTCATCCCCTTTTGTAATGGATATCATATTTGGAAATGCCTATTTAATACTTGTATTTGCTGCTGGACT 
GTAAGCCCATGAGGGCACTGTTTATTATTGAATGTCATCTCTGTTCATCATTGACTGCTCTTTGCTCATCATTGAAT 
CCCCCAGCAAAGTGCCTAGAACATAATAGTGCTTATGCTTGACACCGGTTATTTTTCATCAAACCTGATTCCTTCTG 
TCCTGAACACATAGCCAGGCAATTTTCCAGCCTTCTTTGAGTTGGGTATTATTAAATTCTGGCCATTACTTCCAATG 
TGAGTGGAAGTGACATGTGCAATTTCTATACCTGGCTCATAAAACCCTCCCATGTGCAGCCTTTCATGTTGACATTA 
AATGTGACTTGGGAAGCTATGTGTTACACAGAGTAAATCACCAGAAGCCTGGATTTCTGAAAAAACTGTGCAGAGCC 
AAACCTCTGTCATTTGCAACTCCCACTTGTATTTGTACGAGGCAGTTGGATAAGTGAAAAATAAAGTACTATTGTGT 
CAAGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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AGCACCTTGGCACAGCTGAAGCAGAGGAACACGCTGAAGGATGGTATCATCATGATCCAGACGCTGCTGATCATCCT 
CTTCATCATCGTGCCTATCTTCCTGCTGCTGGACAAGGATGACAGCAAGGCTGGCATGGAGGAAGATCACACCTACG 
AGGGCCTGGACATTGACCAGACAGCCACCTATGAGGACATAGTGACGCTGCGGACAGGGGAAGTGAAGTGGTCTGTA 
GGTGAGCACCCAGGCCAGGAGTGAGAGCCAGGTCGCCCCATGACCTGGGTGCAGGCTCCCTGGCCTCAGTGACTGCT 
TCGGAGCTGCCTGGCTCATGGCCCAACCCCTTTCCCGGACCCCCCAGCTGGCCTCTGAAGCTGGCCCACCAGAGCTG 
CCATTTGTCTCACCCCTGGTGTCCAGCTCTTGCCAAAGGGCCTGGAGTNNAAGGACAACAGGCANCACTTGGAGGGA 
GTTCTCTGGGGATGGACGGGACCCACCTT 
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CAGGAAGGTTCCTCTCCCAGTGGCCATGGGTAGCAACAGTGGGCAGGCTGGCCGCCACATCTATAAATCCCTAGCTG 
ATGATGGCCCCTTTGACTCTGTGGAGCCGCCTAAAAGACCCACCAGCAGACTCATCATGCACAGCATGGCCATGTTC 
GGAAGAGAGTTCTGCTACGCGGTGGAGGCAGCGTATGTGACCCCAGTCCTGCTCAGCGTAGGTCTGCCCAGCAGCCT 
GTACAGCATTGTGTGGTTCCTCAGCCCCATCCTGGGATTCCTGCTGCAGCCCGTGGTCGGATCGGCCAGCGACCACT 
GCCGGTCCAGGTGGGGCCGCCGGAGACCCTACATCCTCACCCTGGGAGTCATGATGCTCGTGGGCATGGCTCTGTAC 
CTCAATGGGGCTACTGTTGTAGCAGCTTTGATTGCTAACCCAAGGAGGAAGCTGGTTTGGGCCATAAGTGTCACCAT 
GATAGGTGTCGTTCTCTTTGATTTTGCTGCCGACTTCATTGATGGGCCCATCAAAGCCTACTTATTTGATGTCTGCT 
CCCATCAGGACAAGGAGAAGGGCCTCCACTACCATGCCCTCTTCACAGGTTTTGGAGGTGCCCTGGGTTACCTTTTG 
GGTGCTATAGACTGGGCCCATCTGGAGCTGGGAAGACTGTTGGGTACAGAATTCCAGGTCATGTTCTTCTTCTCTGC 
ATTGGTGCTCACTTTGTGTTTTACTGTTCATCTGTGCAGTATCTCTGAAGCCCCACTTACAGAGGTTGCAAAGGGCA 
TTCCCCCACAGCAAACCCCTCAGGACCCTCCATTGTCATCAGATGGAATGTACGAGTATGGTTCTATCGAGAAAGTT 
AAAAATGGTTACGTAAATCCAGAGCTGGCAATGCAGGGAGCAAAAAACAAAAATCATGCTGAACAGACTCGCAGGGC 
AATGACATTAAAGTCACTGCTGAGAGCACTGGTGAACATGCCTCCTCACTACCGCTACCTTTGCATCAGCCACCTCA 
TTGGATGGACGGCCTTCCTGTCCAACATGCTGTTCTTCACAGATTTCATGGGCCAGATTGTGTACCGCGGGGATCCC 
TATAGTGCACACAACTCCACAGAGTTTCTCATCTACGAAAGAGGAGTCGAGGTTGGATGTTGGGGCTTCTGCATCAA 
CTCCGTGTTTTCCTCACTTTATTCTTACTTTCAGAAAGTTTTGGTATCCTACATTGGATTAAAGGGTCTTTACTTCA 
CGGGATATTTGCTGTTTGGCCTGGGGACGGGATTTATTGGGCTCTTCCCGAATGTCTACTCCACCCTGGTCCTGTGC 
AGCCTGTTTGGTGTAATGTCCAGCACCCTGTACACTGTGCCCTTTAACCTCATTACTGAGTACCACCGCGAGGAAGA 
AAAGGAGAGGCAGCAGGCCCCAGGAGGGGACCCAGACAACAGCGTGAGAGGGAAGGGCATGGACTGCGCCACCCTCA 
CATGCATGGTGCAGCTGGCTCAGATCCTGGTCGGAGGTGGCCTGGGCTTTCTGGTCAACACAGCCGGGACCGTTGTC 
GTCGTGGTGATCACAGCGTCTGCGGTGGCACTGATAGGCTGTTGCTTTGTCGCTCTCTTTGTTAGATATGTGGATTA 
GGTCAATAAAGAGACAATGACCCTAAAAAAAAA 
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GAATCATCCACGCACCTGCAGCTCTGCTGAGAGAGTGCAAGCCGTGGGGGTTTTGAGCTCATCTTCATCATTCATAT 
GAGGAAATAAGTGGTAAAATCCTTGGAAATACAATGAGACTCATCAGAAACATTTACATATTTTGTAGTATTGTTAT 
GACAGCAGAGGGTGATGCTCCAGAGCTGCCAGAAGAAAGGGAACTGATGACCAACTGCTCCAACATGTCTCTAAGAA 
AGGTTCCCGCAGACTTGACCCCAGCCACAACGACACTGGATTTATCCTATAACCTCCTTTTTCAACTCCAGAGTTCA 
GATTTTCATTCTGTCTCCAAACTGAGAGTTTTGATTCTATGCCATAACAGAATTCAACAGCTGGATCTCAAAACCTT 
TGAATTCAACAAGGAGTTAAGATATTTAGATTTGTCTAATAACAGACTGAAGAGTGTAACTTGGTATTTACTGGCAG 
GTCTCAGGTATTTAGATCTTTCTTTTAATGACTTTGACACCATGCCTATCTGTGAGGAAGCTGGCAACATGTCACAC 
CTGGAAATCCTAGGTTTGAGTGGGGCAAAAATACAAAAATCAGATTTCCAGAAAATTGCTCATCTGCATCTAAATAC 
TGTCTTCTTAGGATTCAGAACTCTTCCTCATTATGAAGAAGGTAGCCTGCCCATCTTAAACACAACAAAACTGCACA 
TTGTTTTACCAATGGACACAAATTTCTGGGTTCTTTTGCGTGATGGAATCAAGACTTCAAAAATATTAGAAATGACA 
AATATAGATGGCAAAAGCCAATTTGTAAGTTATGAAATGCAACGAAATCTTAGTTTAGAAAATGCTAAGACATCGGT 
TCTATTGCTTAATAAAGTTGATTTACTCTGGGACGACCTTTTCCTTATCTTACAATTTGTTTGGCATACATCAGTGG 
AACACTTTCAGATCCGAAATGTGACTTTTGGTGGTAAGGCTTATCTTGACCACAATTCATTTGACTACTCAAATACT 
GTAATGAGAACTATAAAATTGGAGCATGTACATTTCAGAGTGTTTTACATTCAACAGGATAAAATCTATTTGCTTTT 
GACCAAAATGGACATAGAAAACCTGACAATATCAAATGCACAAATGCCACACATGCTTTTCCCGAATTATCCTACGA 
AATTCCAATATTTAAATTTTGCCAATAATATCTTAACAGACGAGTTGTTTAAAAGAACTATCCAACTGCCTCACTTG 
AAAACTCTCATTTTGAATGGCAATAAACTGGAGACACTTTCTTTAGTAAGTTGCTTTGCTAACAACACACCCTTGGA 
ACACTTGGATCTGAGTCAAAATCTATTACAACATAAAAATGATGAAAATTGCTCATGGCCAGAAACTGTGGTCAATA 
TGAATCTGTCATACAATAAATTGTCTGATTCTGTCTTCAGGTGCTTGCCCAAAAGTATTCAAATACTTGACCTAAAT 
AATAACCAAATCCAAACTGTACCTAAAGAGACTATTCATCTGATGGCCTTACGAGAACTAAATATTGCATTTAATTT 
TCTAACTGATCTCCCTGGATGCAGTCATTTCAGTAGACTTTCAGTTCTGAACATTGAAATGAACTTCATTCTCAGCC 
CATCTCTGGATTTTGTTCAGAGCTGCCAGGAAGTTAAAACTCTAAATGCGGGAAGAAATCCATTCCGGTGTACCTGT 
GAATTAAAAAATTTCATTCAGCTTGAAACATATTCAGAGGTCATGATGGTTGGATGGTCAGATTCATACACCTGTGA 
ATACCCTTTAAACCTAAGGGGAACTAGGTTAAAAGACGTTCATCTCCACGAATTATCTTGCAACACAGCTCTGTTGA 
TTGTCACCATTGTGGTTATTATGCTAGTTCTGGGGTTGGCTGTGGCCTTCTGCTGTCTCCACTTTGATCTGCCCTGG 
TATCTCAGGATGCTAGGTCAATGCACACAAACATGGCACAGGGTTAGGAAAACAACCCAAGAACAACTCAAGAGAAA 
TGTCCGATTCCACGCATTTATTTCATACAGTGAACATGATTCTCTGTGGGTGAAGAATGAATTGATCCCCAATCTAG 
AGAAGGAAGATGGTTCTATCTTGATTTGCCTTTATGAAAGCTACTTTGACCCTGGCAAAAGCATTAGTGAAAATATT 
GTAAGCTTCATTGAGAAAAGCTATAAGTCCATCTTTGTTTTGTCTCCCAACTTTGTCCAGAATGAGTGGTGCCATTA 
TGAATTCTACTTTGCCCACCACAATCTCTTCCATGAAAATTCTGATCATATAATTCTTATCTTACTGGAACCCATTC 
CATTCTATTGCATTCCCACCAGGTATCATAAACTGAAAGCTCTCCTGGAAAAAAAAGCATACTTGGAATGGCCCAAG 
GATAGGCGTAAATGTGGGCTTTTCTGGGCAAACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAGAGAAAT 
GTATGAACTGCAGACATTCACAGAGTTAAATGAAGAGTCTCGAGGTTCTACAATCTCTCTGATGAGAACAGATTGTC 
TATAAAATCCCACAGTCCTTGGGAAGTTGGGGACCACATACACTGTTGGGATGTACATTGATACAACCTTTATGATG 
GCAATTTGACAATATTTATTAAAATAAAAAATGGTTATTCCCTTCATATCAGTTTCTAGAAGGATTTCTAAGAATGT 
ATCCTATAGAAACACCTTCACAAGTTTATAAGGGCTTATGGAAAAAGGTGTTCATCCCAGGATTGTTTATAATCATG 
AAAAATGTGGCCAGGTGCAGTGGCTCACTCTTGTAATCCCAGCACTATGGGAGGCCAAGGTGGGTGACCCACGAGGT 
CAAGAGATGGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCTGGGCGTGAT 
GGTGCACGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGTGGCAGTTGCAG 
TGAGCTGAGATCGAGCCACTGCACTCCAGCCTGGTGACAGAGCGAGACTCCATCTCAAAAAAAAGAAAAAAAAAAAA 
GAAAAAAATGGAAAACATCCTCATGGCCACAAAATAAGGTCTAATTCAATAAATTATAGTACATTAATGTAATATAA 
TATTACATGCCACTAAAAAGAATAAGGTAGCTGTATATTTCCTGGTATGGAAAAAACATATTAATATGTTATAAACT 
ATTAGGTTGGTGCAAAACTAATTGTGGTTTTTGCCATTGAAATGGCATTGAAATAAAAGTGTAAAGAAATCTATACC 
AGATGTAGTAACAGTGGTTTGGGTCTGGGAGGTTGGATTACAGGGAGCATTTGATTTCTATGTTGTGTATTTCTATA 
ATGTTTGAATTGTTTAGAATGAATCTGTATTTCTTTTATAAGTAGAAAAAAAATAAAGATAGTTTTTACAGCCT 
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GTCACCTGGATGGTGTAAAGGAAACAAGAGACAGGAACAGAGCCCCTCATCTCACCTCTGGGCTACCATACAGAAAA 
GCAGAATTGGCAGGAACTGAA AATG ACTAGGAAGAGGACATACTGGGTGCCCAACTCTTCTGGTGGCCTCGTGAATC 
GTGGCATCGACATAGGCGATGACATGGTTTCAGGACTTATTTATAAAACCTATACTCTCCAAGATGGCCCCTGGAGT 
CAGCAAGAGAGAAATCCTGAGGCTCCAGGGAGGGCAGCTGTCCCACCGTGGGGGAAGTATGATGCTGCCTTGAGAAC 
CATGATTCCCTTCCGTCCCAAGCCGAGGTTTCCTGCCCCCCAGCCCCTGGACAATGCTGGCCTGTTCTCCTACCTCA 
CCGTGTCATGGCTCACCCCGCTCATGATCCAAAGCTTACGGAGTCGCTTAGATGAGAACACCATCCCTCCACTGTCA 
GTCCATGATGCCTCAGACAAAAATGTCCAAAGGCTTCACCGCCTTTGGGAAGAAGAAGTCTCAAGGCGAGGGATTGA 
AAAAGCTTCAGTGCTTCTGGTGATGCTGAGGTTCCAGAGAACAAGGTTGATTTTCGATGCACTTCTGGGCATCTGCT 
TCTGCATTGCCAGTGTACTCGGGCCAATATTGATTATACCAAAGATCCTGGAATATTCAGAAGAGCAGTTGGGGAAT 
GTTGTCCATGGAGTGGGACTCTGCTTTGCCCTTTTTCTCTCCGAATGTGTGAAGTCTCTGAGTTTCTCCTCCAGTTG 
GATCATCAACCAACGCACAGCCATCAGGTTCCGAGCAGCTGTTTCCTCCTTTGCCTTTGAGAAGCTCATCCAATTTA 
AGTCTGTAATACACATCACCTCAGGAGAGGCCATCAGCTTCTTCACCGGTGATGTAAACTACCTGTTTGAAGGGGTG 
TGCTATGGACCCCTAGTACTGATCACCTGCGCATCGCTGGTCATCTGCAGCATTTCTTCCTACTTCATTATTGGATA 
CACTGCATTTATTGCCATCTTATGCTATCCCCTGGTTTTCCCACTGGAGGTATTCATGACAAGAATGGCTGTGAAGG 
CTCAGCATCACACATCTGAGGTCAGCGACCAGCGCATCCGTGTGACCAGTGAAGTTCTCACTTGCATTAAGCTGATT 
AAAATGTACACATGGGAGAAACCATTTGCAAAAATCATTGAAGACCTAAGAAGGAAGGAAAGGAAGCTATTGGAGAA 
GTGCGGGCTTGTCCAGAGCCTGACAAGTATAACCTTGTTCATCATCCCCGCAGTGGCCACAGCGGTCTGGGTTCTCA 
TCCACACATCCTTAAAGCTGAAACTCACAGCGTCAATGGCCTTCAGCATGCTAGCCTCCTTGAATCTCCTTCGGCTG 
TCAGTGTTCTTTGTGCCTATTGCAGTCAAAGGTCTCACGAATTCCAAGTCTGCAGTGATGAGGTTCAAGAAGTTTTT 
CCTCCAGGAGAGCCCTGTTTTCTATGTCCAGACATTACAAGACCCCAGCAAAGCTCTGGTCTTTGAGGAGGCCACCT 
TGTCATGGCAACAGACCTGTCCCGGGATCGTCAATGGGGCACTGGAGCTGGAGAGGAACGGGCATGCTTCTGAGGGG 
ATGACCAGGCCTAGAGATGCCCTCGGGCCAGAGGAAGAAGGGAACAGCCTGGGCCCAGAGTTGCACAAGATCAACCT 
GGTGGTGTCCAAGGGGATGATGTTAGGGGTCTGCGGCAACACGGGGAGTGGTAAGAGCAGCCTGTTGTCAGCCATCC 
TGGAGGAGATGCACTTGCTCGAGGGCTCGGTGGGGGTGCAGGGAAGCCTGGCCTATGTCCCCCAGCAGGCCTGGATC 
GTCAGCGGGAACATCAGGGAGAACATCCTCATGGGAGGCGCATATGACAAGGCCCGATACCTCCAGGTGCTCCACTG 
CTGCTCCCTGAATCGGGACCTGGAACTTCTGCCCTTTGGAGACATGACAGAGATTGGAGAGCGGGGCCCCAACCTCT 
CTGGGGGGCAGAAACAGAGGATCAGCCTGGCCCGCGCCGTCTATTCCGACCGTCAGATCTACCTGCTGGACGACCCC 
CTGTCTGCTGTGGACGCCCACGTGGGGAAGCACATTTTTGAGGAGTGCATTAAGAAGACACTCAGGGGGAAGACGGT 
CGTCCAGGTGACCCACCAGCTGCAGTACTTAGAATTTTGTGGCCAGGTCATTTTGTTGGAAAATGGGAAAATCTGTG 
AAAATGGAACTCACAGTGAGTTAATGCAGAAAAAGGGGAAATATGCCCAACTTATCCAGAAGATGCACAAGGAAGCC 
ACTTCGGACATGTTGCAGGACACAGCAAAGATAGCAGAGAAGCCAAAGGTAGAAAGTCAGGCTCTGGCCACCTCCCT 
GGAAGAGTCTCTCAACGGAAATGCTGTGCCGGAGCATCAGCTCACACAGGAGGAGGAGATGGAAGAAGGCTCCTTGA 
GTTGGAGGGTCTACCACCACTACATCCAGGCAGCTGGAGGTTACATGGTCTCTTGCATAATTTTCTTCTTTGTGGTG 
CTGATCGTCTTCTTAACGATCTTCAGCTTCTGGTGGCTGAGCTACTGGTTGGAGCAGGGCTCGGGGACCAATAGCAG 
CCGAGAGAGCAATGGAACCATGGCAGACCTGGGCAACATTGCAGACAATCCTCAACTGTCCTTCTACCAGCTGGTGT 
ACGGGCTCAACGCCCTGCTCCTCATCTGTGTGGGGGTCTGCTCCTCAGGGATTTTCACCAAGGTCACGAGGAAGGCA 
TCCACGGCCCTGCACAACAAGCTCTTCAACAAGGTTTTCCGCTGCCCCATGAGTTTCTTTGACACCATCCCAATAGG 
CCGGCTTTTGAACTGCTTCGCAGGGGACTTGGAACAGCTGGACCAGCTCTTGCCCATCTTTTCAGAGCAGTTCCTGG 
TCCTGTCCTTAATGGTGATCGCCGTCCTGTTGATTGTCAGTGTGCTGTCTCCATATATCCTGTTAATGGGAGCCATA 
ATCATGGTTATTTGCTTCATTTATTATATGATGTTCAAGGAGGCCATCGGTGTGTTCAAGAGACTGGAGAACTATAG 
CCGGTCTCCTTTATTCTCCCACATCCTCAATTCTCTGCAAGGCCTGAGCTCCATCCATGTCTATGGAAAAACTGAAG 
ACTTCATCAGCCAGTTTAAGAGGCTGACTGATGCGCAGAATAACTACCTGCTGTTGTTTCTATCTTCCACACGATGG 
ATGGCATTGAGGCTGGAGATCATGACCAACCTTGTGACCTTGGCCGTTGCCCTGTTCGTGGCTTTTGGCATTTCCTC 
CACCCCCTACTCCTTTAAAGTCATGGCTGTCAACATCGTGCTGCAGCTGGCGTCCAGCTTCCAGGCCACTGCCCGGA 
TTGGCTTGGAGACAGAGGCACAGTTCACGGCTGTAGAGAGGATACTGCAGTACATGAAGATGTGTGTCTCGGAAGCT 
CCTTTACACATGGAAGGCACAAGTTGTCCCCAGGGGTGGCCACAGCATGGGGAAATCATATTTCAGGATTATCACAT 
GAAATACAGAGACAACACACCCACCGTGCTTCACGGCATCAACCTGACCATCCGCGGCCACGAAGTGGTGGGCATCG 
TGGGAAGGACGGGCTCTGGGAAGTCCTCCTTGGGCATGGCTCTCTTCCGCCTGGTGGAGCCCATGGCAGGCCGGATT 
CTCATTGACGGCGTGGACATTTGCAGCATCGGCCTGGAGGACTTGCGGTCCAAGCTCTCAGTGATCCCTCAAGATCC 
AGTGCTGCTCTCAGGAACCATCAGATTCAACCTAGATCCCTTTGACCGTCACACCGACCAGCAGATCTGGGATGCCT 
TGGAGAGGACATTCCTGACCAAGGCCATCTCAAAGTTCCCCAAAAAGCTGCATACAGATGTGGTGGAAAACGGTGGA 
AACTTCTCTGTGGGGGAGAGGCAGCTGCTCTGCATTGCCAGGGCTGTGCTTCGCAACTCCAAGATCATC 
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CTTATCGATGAAGCCACAGCCTCCATTGACATGGAGACAGACACCCTGATCCAGCGCACAATCCGTGAAGCCTTCCA 
GGGCTGCACCGTGCTCGTCATTGCCCACCGTGTCACCACTGTGCTGAACTGTGACCGCATCCTGGTTATGGGCAATG 
GGAAGGTGGTAGAATTTGATCGGCCGGAGGTACTGCGGAAGAAGCCTGGGTCATTGTTCGCAGCCCTCATGGCCACA 
GCCACTTCTTCACTGAGATAAGGAGATGTGGAGACTTCATGGAGGCTGGCAGCTGAGCTCAGAGGTTCACACAGCTG 
CAGCTTCGAGGCCCACAGTCTGCGACCTTCTTGTTTGGAGATGAGAACTTCTCCTGGAAGCAGGGGTAAATGTAGGG 
GGGGTGGGGATTGCTGGATGGAAACCCTGGAATAGGCTACTTGATGGCTCTCAAGACCTTAGAACCCCAGAACCATC 
TAAGACATGGGATTCAGTGATCATGTGGTTCTCC 
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GGTTGCCACAGCTGGTTTAGGGCCCCGACCACTGGGGCCCCTTGTCAGGAGGAGACAGCCTCCCGGCCCGGGGAGGA 
CAAGTCGCTGCCACCTTTGGCTGCCGACGTGATTCCCTGGGACGGTCCGTTTCCTGCCGTCAGCTGCCGGCCGAGTT 
GGGTCTCCGTGTTTCAGGCCGGCTCCCCCTTCCTGGTCTCCCTTCTCCCGCTGGGCCGGTTTATCGGGAGGAGATTG 
TCTTCCAGGGCTAGCAATTGGACTTTTGATGATGTTTGACCCAGCGGCAGGAATAGCAGGCAACGTGATTTCAAAGC 
TGGGCTCAGCCTCTGTTTCTTCTCTCGTGTAATCGCAAAACCCATTTTGGAGCAGGAATTCCAATC ATG TCTGTGAT 
GGTGGTGAGAAAGAAGGTGACACGGAAATGGGAGAAACTCCCAGGCAGGAACACCTTTTGCTGTGATGGCCGCGTCA 
TGATGGCCCGGCAAAAGGGCATTTTCTACCTGACCCTTTTCCTCATCCTGGGGACATGTACACTCTTCTTCGCCTTT 
GAGTGCCGCTACCTGGCTGTTCAGCTGTCTCCTGCCATCCCTGTATTTGCTGCCATGCTCTTCCTTTTCTCCATGGC 
TACACTGTTGAGGACCAGCTTCAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATGAAGCAGCTTTCATAGAAA 
TGGAGATAGAAGCTACCAATGGTGCGGTGCCCCAGGGCCAGCGACCACCGCCTCGTATCAAGAATTTCCAGATAAAC 
AACCAGATTGTGAAACTGAAATACTGTTACACATGCAAGATCTTCCGGCCTCCCCGGGCCTCCCATTGCAGCATCTG 
TGACAACTGTGTGGAGCGCTTCGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTACCGCTACT 
TCTACCTCTTCATCCTTTCTCTCTCCCTCCTCACAATCTATGTCTTCGCCTTCAACATCGTCTATGTGGCCCTCAAA 
TCTTTGAAAATTGGCTTCTTGGAGACATTGAAAGAAACTCCTGGAACTG'TTCTAGAAGTCCTCATTTGCTTCTTTAC 
ACTCTGGTCCGTCGTGGGACTGACTGGATTTCATACTTTCCTCGTGGCTCTCAACCAGACAACCAATGAAGACATCA 
AAGGATCATGGACAGGGAAGAATCGCGTCCAGAATCCCTACAGCCATGGCAATATTGTGAAGAACTGCTGTGAAGTG 
CTGTGTGGCCCCTTGCCCCCCAGTGTGCTGGATCGAAGGGGTATTTTGCCACTGGAGGAAAGTGGAAGTCGACCTCC 
CAGTACTCAAGAGACCAGTAGCAGCCTCTTGCCACAGAGCCCAGCCCCCACAGAACACCTGAACTCAAATGAGATGC 
CGGAGGACAGCAGCACTCCCGAAGAGATGCCACCTCCAGAGCCCCCAGAGCCACCACAGGAGGCAGCTGAAGCTGAG 
AAGTAGCCTATCTATGGAAGAGACTTTTGTTTGTGTTTAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACC 
TGAGACAGAGAGCAAGTAAGCTGTCCCTTTTAACTGTTTTTCTTTGGTCTTTAGTCACCCAGTTGCACACTGGCATT 
TTCTTGCTGCAAGCTTTTTTAAATTTCTGAACTCAAGGCAGTGGCAGAAGATGTCAGTCACCTCTGATAACTGGAAA 
AATGGGTCTCTTGGGCCCTGGCACTGGTTCTCCATGGCCTCAGCCACAGGGTCCCCTTGGACCCCCTCTCTTCCCTC 
CAGATCCCAGCCCTCCTGCTTGGGGTCACTGGTCTCATTCTGGGGCTAAAAGTTTTTGAGACTGGCTCAAATCCTCC 
CAAGCTGCTGCACGTGCTGAGTCCAGAGGCAGTCACAGAGACCTCTGGCCAGGGGATCCTAACTGGGTTCTTGGGGT 
CTTCAGGACTGAAGAGGAGGGAGAGTGGGGTCAGAAGATTCTCCTGGCCACCAAGTGCCAGCATTGCCCACAAATCC 
TTTTAGGAATGGGACAGGTACCTTCCACTTGTTGTAXXXXXXXXXXXXXXXXXXXXXXXXXXXTTGTTTTTCCTTTT 
GACTCCTGCTCCCATTAGGAGCAGGAATGGCAGTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAG 
CCCGAGTGCTCACTTAAACACTATCCCCTCAGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGGA 
AACCAAGGCACAGAGAGGCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCAAAAAAAAAAAAATGCTAACCAGT 
TCTTCCATTAAGCCTCGGCTGAGTGAGGGAAAGCCCAGCACTGCTGCCCTCTCGGGTAACTCACCCTAAGGCCTCGG 
CCCACCTCTGGCTATGGTAACCACACTGGGGGCTTCCTCCAAGCCCCGCTCTTCCAGCACTTCCACCGGCAGAGTCC 
CAGAGCCACTTCACCCTGGGGGTGGGCTGTGGCCCCCAGTCAGCTCTGCTCAGGACCTGCTCTATTTCAGGGAAGAA 
GATTTATGTATTATATGTGGCTATATTTCCTAGAGCACCTGTGTTTTCCTCTTTCTAAGCCAGGGTCCTGTCTGGAT 
GACTTATGCGGTGGGGGAGTGTAAACCGGAACTTTTCATCTATTTGAAGGCGATTAAACTGTGTCTAATGCA 
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GAGCCGCCGCCGCGCGCGCGCCGCGCACTGCAGCCCCAGGCCCCGGCCCCCCACCCACGTCTGCGTTGCTGCCCCGC 
CTGGGCCAGGCCCCAAAGGCAAGGACAAAGCAGCTGTCAGGGAACCTCCGCCGGAGTCGAATTTACGTGCAGCTGCC 
GGCAACCACAGGTTCCAAGATGGTTTGCGGGGGCTTCGCGTGTTCCAAGAACTGCCTGTGCGCCCTCAACCTGCTTT 
ACACCTTGGTTAGTCTGCTGCTAATTGGAATTGCTGCGTGGGGCATTGGCTTCGGGCTGATTTCCAGTCTCCGAGTG 
GTCGGCGTGGTCATTGCAGTGGGCATCTTCTTGTTCCTGATTGCTTTAGTGGGTCTGATTGGAGCTGTAAAACATCA 
TCAGGTGTTGCTATTTTTTTATATGATTATTCTGTTACTTGTATTTATTGTTCAGTTTTCTGTATCTTGCGCTTGTT 
TAGCCCTGAACCAGGAGCAACAGGGTCAGCTTCTGGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATC 
CAGAGAAATCTAAACTGCTGTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTCTGGCTAGCTGTGTTAAAAGTGA 
CCACTCGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGAGATTTGTTGGTGGCATTGGCC 
TGTTCTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACCTACAGATACAGGAACCAGAAAGACCCCCGCGCGAAT 
CCTAGTGCATTCCTTTGATGAGAAAACAAGGAAGATTTCCTTTCGTATTATGATCTTGTTCACTTTCTGTAATTTTC 
TGTTAAGCTCCATTTGCCAGTTTAAGGAAGGAAACACTATCTGGAAAAGTACCTTATTGATAGTGGAATTATATATT 
TTTACTCTATGTTTCTCTACATGTTTTTTTCTTTCCGTTGCTGAAAAATATTTGAAACTTGTGGTCTCTGAAGCTCG 
GTGGCACCTGGAATTTACTGTATTCATTGTCGGGCACTGTCCACTGTGGCCTTTCTTAGCATTTTTACCTGCAGAAA 
AACTTTGTATGGTACCACTGTGTTGGTTATATGGTGAATCTGAACGTACATCTCACTGGTATAATTATATGTAGCAC 
TGTGCTGTGTAGATAGTTCCTACTGGAAAAAGAGTGGAAATTTATTAAAATCAGAAAGTATGAGATCCTGTTATGTT 
AAGGGAAATCCAAATTCCCAATTTTTTTTGGTCTTTTTAGGAAAGATTGTTGTGGTAAAAAGTGTTAGTATAAAAAT 
GATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAAATAGTTATGTCTTAGGAAATTGTGGTTTAA 
TTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTGGTTTCATGAAATGTTCTAATGTATAATAACATTTACCTTCA 
GCCTCCATCAGAATGGAACGAGTTTTGAGTAATCAGGAAGTATATCTATATGATCTTGATATTGTTTTATAATAATT 
TGAAGTCTAAAAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGATATTTGATTAT 
CTTAAAAATTGTTAAATACCGTTTTCATGAAATTTCTCAGTATTGTAACAGCAACTTGTCAAACCTAAGCATATTTG 
AATATGATCTCCCATAATTTGAAATTGAAATCGTATTGTGTGGCTCTGTATATTCTGTTAAAAAATTAAAGGACAGA 
AACCTTTCTTTGTGTATGCATGTTTGAATTAAAAGAAAGTAATGGAAG 
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CAGTCACCATGAAGCTGGGCTGTGTCCTCATGGCCTGGGCCCTCTACCTTTCCCTTGGTGTGCTCTGGGTGGCCCAG 
ATGCTACTGGCTGCCAGTTTTGAGACGCTGCAGTGTGAGGGACCTGTCTGCACTGAGGAGAGCAGCTGCCACACGGA 
GGATGACTTGACTGATGCAAGGGAAGCTGGCTTCCAGGTCAAGGCCTACACTTTCAGTGAACCCTTCCACCTGATTG 
TGTCCTATGACTGGCTGATCCTCCAAGGTCCAGCCAAGCCAGTTTTTGAAGGGGACCTGCTGGTTCTGCGCTGCCAG 
GCCTGGCAAGACTGGCCACTGACTCAGGTGACCTTCTACCGAGATGGCTCAGCTCTGGGTCCCCCCGGGCCTAACAG 
GGAATTCTCCATCACCGTGGTACAAAAGGCAGACAGCGGGCACTACCACTGCAGTGGCATCTTCCAGAGCCCTGGTC 
CTGGGATCCCAGAAACAGCATCTGTTGTGGCTATCACAGTCCAAGAACTGTTTCCAGCGCCAATTCTCAGAGCTGTA 
CCCTCAGCTGAACCCCAAGCAGGAAGCCCCATGACCCTGAGTTGTCAGACAAAGTTGCCCCTGCAGAGGTCAGCTGC 
CCGCCTCCTCTTCTCCTTCTACAAGGATGGAAGGATAGTGCAAAGCAGGGGGCTCTCCTCAGAATTCCAGATCCCCA 
CAGCTTCAGAAGATCACTCCGGGTCATACTGGTGTGAGGCAGCCACTGAGGACAACCAAGTTTGGAAACAGAGCCCC 
CAGCTAGAGATCAGAGTGCAGGGTGCTTCCAGCTCTGCTGCACCTCCCACATTGAATCCAGCTCCTCAGAAATCAGC 
TGCTCCAGGAACTGCTCCTGAGGAGGCCCCTGGGCCTCTGCCTCCGCCGCCAACCCCATCTTCTGAGGATCCAGGCT 
TTTCTTCTCCTCTGGGGATGCCAGATCCTCATCTGTATCACCAGATGGGCCTTCTTCTCAAACACATGCAGGATGTG 
AGAGTCCTCCTCGGTCACCTGCTCATGGAGTTGAGGGAATTATCTGGCCACCAGAAGCCTGGGACCACAAAGGCTAC 
TGCTGAATAGAAGTAAACAGTTCATCCATGATCTCACTTAACCACCCCAATAAATCTGATTCTTTATTTTCTCTTCC 
TGTCCTGCACATATGCATAAGTACTTTTACAAGTTGTCCCAGTGTTTTGTTAGAATAATGTAGTTAGGTGAGTGTAA 
ATAAATTTATATAAAGTGAGAATTAGAGTTTAGCTATAATTGTGTATTCTCTCTTAACACAACAGAATTCTGCTGTC 
TAGATCAGGAATTTCTATCTGTTATATCGACCAGAATGTTGTGATTTAAAGAGAACTAATGGAAGTGGATTGAATAC 
AGCAGTCTCAACTGGGGGCAATTTTGCCCCCCAGAGGACATTGGGCAATGTTTGGAGACATTTTGGTCATTATACTT 
GGGGGGTTGGGGGATGGTGGGATGTGTGTCTACTGGCATCCAGTAAATAGAAGCCAGGGGTGCCGCTAAACATCCTA 
TAATGCACAGGGCAGTACCCCACAACGAAAAATAATCTGGCCCAAAATGTCAGTTGTACTGAGTTTGAGAAACCCCA 
GCCTAATGAAACCCTAGGTGTTGGGCTCTGGAATGGGACTTTGTCCCTTCTAATTATTATCTCTTTCCAGCCTCATT 
CAGCTATTCTTACTGACATACCAGTCTTTAGCTGGTGCTATGGTCTGTTCTTTAGTTCTAGTTTGTATCCCCTCAAA 
AGCCATTATGTTGAAATCCTAATCCCCAAGGTGATGGCATTAAGAAGTGGGCCTTTGGGAAGTGATTAGATCAGGAG 
TGCAGAGCCCTCATGATTAGGATTAGTGCCCTTATTTAAAAAGGCCCCAGAGAGCTAACTCACCCTTCCACCATATG 
AGGACGTGGCAAGAAGATGACATGTATGAGAACCAAAAAACAGCTGTCGCCAAACACCGACTCTGTCGTTGCCTTGA 
TCTTGAACTTCCAGCCTCCAGAACTATGAGAAATAAAATTCTGGTTGTTTGTAGCCTAA 
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CCGGCTCGCGCCCTCCGGGCCCAGCCTCCCGAGCCTTCGGAGCGGGCGCCGTCCCAGCCCAGCTCCGGGGAAACGCG 
AGCCGCGATGCCTGGGGGGTGCTCCCGGGGCCCCGCCGCCGGGGACGGGCGTCTGCGGCTGGCGCGACTAGCGCTGG 
TACTCCTGGGCTGGGTCTCCTCGTCTTCTCCCACCTCCTCGGCATCCTCCTTCTCCTCCTCGGCGCCGTTCCTGGCT 
TCCGCCGTGTCCGCCCAGCCCCCGCTGCCGGACCAGTGCCCCGCGCTGTGCGAGTGCTCCGAGGCAGCGCGCACAGT 
CAAGTGCGTTAACCGCAATCTGACCGAGGTGCCCACGGACCTGCCCGCCTACGTGCGCAACCTCTTCCTTACCGGCA 
ACCAGCTGGCCGTGCTCCCTGCCGGCGCCTTCGCCCGCCGGCCGCCGCTGGCGGAGCTGGCCGCGCTCAACCTCAGC 
GGCAGCCGCCTGGACGAGGTGCGCGCGGGCGCCTTCGAGCATCTGCCCAGCCTGCGCCAGCTCGACCTCAGCCACAA 
CCCACTGGCCGACCTCAGTCCCTTCGCTTTCTCGGGCAGCAATGCCAGCGTCTCGGCCCCCAGTCCCCTTGTGGAAC 
TGATCCTGAACCACATCGTGCCCCCTGAAGATGAGCGGCAGAACCGGAGCTTCGAGGGCATGGTGGTGGCGGCCCTG 
CTGGCGGGCCGTGCACTGCAGGGGCTCCGCCGCTTGGAGCTGGCCAGCAACCACTTCCTTTACCTGCCGCGGGATGT 
GCTGGCCCAACTGCCCAGCCTCAGGCACCTGGACTTAAGTAATAATTCGCTGGTGAGCCTGACCTACGTGTCCTTCC 
GCAACCTGACACATCTAGAAAGCCTCCACCTGGAGGACAATGCCCTCAAGGTCCTTCACAATGGCACCCTGGCTGAG 
TTGCAAGGTCTACCCCACATTAGGGTTTTCCTGGACAACAATCCCTGGGTCTGCGACTGCCACATGGCAGACATGGT 
GACCTGGCTCAAGGAAACAGAGGTAGTGCAGGGCAAAGACCGGCTCACCTGTGCATATCCGGAAAAAATGAGGAATC 
GGGTCCTCTTGGAACTCAACAGTGCTGACCTGGACTGTGACCCGATTCTTCCCCCATCCCTGCAAACCTCTTATGTC 
TTCCTGGGTATTGTTTTAGCCCTGATAGGCGCTATTTTCCTCCTGGTTTTGTATTTGAACCGCAAGGGGATAAAAAA 
GTGGATGCATAACATCAGAGATGCCTGCAGGGATCACATGGAAGGGTATCATTACAGATATGAAATCAATGCGGACC 
CCAGATTAACAAACCTCAGTTCTAACTCGGATGTCTGAGAAATATTAGAGGACAGACCAAGGACAACTCTGCATGAG 
ATGTAGACTTAAGCTTTATCCCTACTAGGCTTGCTCCACTTTCATCCTCCACTATAGATACAACGGACTTTGACTAA 
AAGCAGTGAAGGGGATTTGCTTCCTTGTTATGTAAAGTTTCTCGGTGTGTTCTGTTAATGTAAGACGATGAACAGTT 
GTGTATAGTGTTTTACCCTCTTCTTTTTCTTGGAACTCCTCAACACGTATGGAGGGATTTTTCAGGTTTCAGCATGA 
ACATGGGCTTCTTGCTGTCTGTCTCTCTCTCAGTACAGTTCAAGGTGTAGCAAGTGTACCCACACAGATAGCATTCA 
ACAAAAGCTGCCTCAACTTTTTCGAGAAAAATACTTTATTCATAAATATCAGTTTTATTCTCATGTACCTAAGTTGT 
GGAGAAAATAATTGCATCCTATAAACTGCCTGCAGACGTTAGCAGGCTCTTCAAAATAACTCCATGGTGCACAGGAG 
CACCTGCATCCAAGAGCATGCTTACATTTTACTGTTCTGCATATTACAAAAAATAACTTGCAACTTCATAACTTCTT 
TGACAAAGTAAATTACTTTTTTGATTGCAGTTTATATGAAAATGTACTGATTTTTTTTTAATAAACTGCATCGAGAT 
C C AAC CG AC T G AAT T G T T AAAAAAAAAAAAAAAT AAAG AT T C T T AAAAG AA 



58/170 



FIGURE 54 

CGGCGAGCGAGCACCTTCGACGCGGTCCGGGGACCCCCTCGTCGCTGTCCTCCCGACGCGGACCCGCGTGCCCCAGG 
CCTCGCGCTGCCCGGCCGGCTCCTCGTGTCCCACTCCCGGCGCACGCCCTCCCGCGAGTCCCGGGCCCCTCCCGCGC 
CCCTCTTCTCGGCGCGCGCGCAGCATGGCGCCCCCGCAGGTCCTCGCGTTCGGGCTTCTGCTTGCCGCGGCGACGGC 
GACTTTTGCCGCAGCTCAGGAAGAATGTGTCTGTGAAAACTACAAGCTGGCCGTAAACTGCTTTGTGAATAATAATC 
GTCAATGCCAGTGTACTTCAGTTGGTGCACAAAATACTGTCATTTGCTCAAAGCTGGCTGCCAAATGTTTGGTGATG 
AAGGCAGAAATGAATGGCTCAAAACTTGGGAGAAGAGCAAAACCTGAAGGGGCCCTCCAGAACAATGATGGGCTTTA 
TGATCCTGACTGCGATGAGAGCGGGCTCTTTAAGGCCAAGCAGTGCAACGGCACCTCCACGTGCTGGTGTGTGAACA 
CTGCTGGGGTCAGAAGAACAGACAAGGACACTGAAATAACCTGCTCTGAGCGAGTGAGAACCTACTGGATCATCATT 
GAACTAAAACACAAAGCAAGAGAAAAACCTTATGATAGTAAAAGTTTGCGGACTGCACTTCAGAAGGAGATCACAAC 
GCGTTATCAACTGGATCCAAAATTTATCACGAGTATTTTGTATGAGAATAATGTTATCACTATTGATCTGGTTCAAA 
ATTCTTCTCAAAAAACTCAGAATGATGTGGACATAGCTGATGTGGCTTATTATTTTGAAAAAGATGTTAAAGGTGAA 
TCCTTGTTTCATTCTAAGAAAATGGACCTGACAGTAAATGGGGAACAACTGGATCTGGATCCTGGTCAAACTTTAAT 
TTATTATGTTGATGAAAAAGCACCTGAATTCTCAATGCAGGGTCTAAAAGCTGGTGTTATTGCTGTTATTGTGGTTG 
TGGTGATAGCAGTTGTTGCTGGAATTGTTGTGCTGGTTATTTCCAGAAAGAAGAGAATGGCAAAGTATGAGAAGGCT 
GAGATAAAGGAGATGGGTGAGATGCATAGGGAACTCAATGC ATAA CTATATAATTTGAAGATTATAGAAGAAGGGAA 
ATAGCAAATGGACACAAATTACAAATGTGTGTGCGTGGGACGAAGACATCTTTGAAGGTCATGAGTTTGTTAGTTTA 
ACATCATATATTTGTAATAGTGAAACCTGTACTCAAAATATAAGCAGCTTGAAACTGGCTTTACCAATCTTGAAATT 
TGACCACAAGTGTCTTATATATGCAGATCTAATGTAAAATCCAGAACTTGGACTCCATCGTTAAAATTATTTATGTG 
TAACATTCAAATGTGTGCATTAAATATGCTTCCACAGTAAAATCTGAAAAACTGATTTGTGATTGAAAGCTGCCTTT 
CTATTTACTTGAGTCTTGTACATACATACTTTTTTATGAGCTATGAAATAAAACATTTTAAACTG 
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CAATTCGGCCTCGCTCCTTGTGATTGCGCTAAACCTTCCGTCCTCAGCTGAGAACGCTCCACCACCTCCCCGGATCG 
CTCATCTCTTGGCTGCCCTCCCACTGTTCCTGATGTTATTTTACTCCCCGTATCCCCTACTCGTTCTTCACAATTCT 
GTAGGTGAGTGGTTCCAGCTGGTGCCTGGCCTGTGTCTCTTGGATGCCCTGTGGCTTCAGTCCGTCTCCTGTTGCCC 
ACCACCTCGTCCCTGGGCCGCCTGATACCCCAGCCCAACAGCTAAGGTGTGGATGGACAGTAGGGGGCTGGCTTCTC 
TCACTGGTCAGGGGTCTTCTCCCCTGTCTGCCTCCCGGAGCTAGGACTGCAGAGGGGCCTATCATGGTGCTTGCAGG 
CCCCCTGGCTGTCTCGCTGTTGCTGCCCAGCCTCACACTGCTGGTGTCCCACCTCTCCAGCTCCCAGGATGTCTCCA 
GTGAGCCCAGCAGTGAGCAGCAGCTGTGCGCCCTTAGCAAGCACCCCACCGTGGCCTTTGAAGACCTGCAGCCGTGG 
GTCTCTAACTTCACCTACCCTGGAGCCCGGGATTTCTCCCAGCTGGCTTTGGACCCCTCCGGGAACCAGCTCATCGT 
GGGAGCCAGGAACTACCTCTTCAGACTCAGCCTTGCCAATGTCTCTCTTCTTCAGGCCACAGAGTGGGCCTCCAGTG 
AGGACACGCGCCGCTCCTGCCAAAGCAAAGGGAAGACTGAGGAGGAGTGTCAGAACTACGTGCGAGTCCTGATCGTC 
GCCGGCCGGAAGGTGTTCATGTGTGGAACCAATGCCTTTTCCCCCATGTGCACCAGCAGACAGGTGGGGAACCTCAG 
CCGGACTATTGAGAAGATCAATGGTGTGGCCCGCTGCCCCTATGACCCACGCCACAACTCCACAGCTGTCATCTCCT 
CCCAGGGGGAGCTCTATGCAGCCACGGTCATCGACTTCTCAGGTCGGGACCCTGCCATCTACCGCAGCCTGGGCAGT 
GGGCCACCGCTTCGCACTGCCCAATATAACTCCAAGTGGCTTAATGAGCCAAACTTCGTGGCAGCCTATGATATTGG 
GCTGTTTGCATACTTCTTCCTGCGGGAGAACGCAGTGGAGCACGACTGTGGACGCACCGTGTACTCTCGCGTGGCCC 
GCGTGTGCAAGAATGACGTGGGGGGCCGATTCCTGCTGGAGGACACATGGACCACATTCATGAAGGCCCGGCTCAAC 
TGCTCCCGCCCGGGCGAGGTCCCCTTCTACTATAACGAGCTGCAGAGTGCCTTCCACTTGCCGGAGCAGGACCTCAT 
CTATGGAGTTTTCACAACCAACGTAAACAGCATCGCGGCTTCTGCTGTCTGCGCCTTCAACCTCAGTGCTATCTCCC 
AGGCTTTCAATGGCCCATTTCGCTACCAGGAGAACCCCAGGGCTGCCTGGCTCCCCATAGCCAACCCCATCCCCAAT 
TTCCAGTGTGGCACCCTGCCTGAGACCGGTCCCAACGAGAACCTGACGGAGCGCAGCCTGCAGGACGCGCAGCGCCT 
CTTCCTGATGAGCGAGGCCGTGCAGCCGGTGACACCCGAGCCCTGTGTCACCCAGGACAGCGTGCGCTTCTCACACC 
TCGTGGTGGACCTGGTGCAGGCTAAAGACACGCTCTACCATGTACTCTACATTGGCACCGAGTCGGGCACCATCCTG 
AAGGCGCTGTCCACGGCGAGCCGCAGCCTCCACGGCTGCTACCTGGAGGAGCTGCACGTGCTGCCCCCCGGGCGCCG 
CGAGCCCCTGCGCAGCCTGCGCATCCTGCACAGCGCCCGCGCGCTCTTCGTGGGGCTGAGAGACGGCGTCCTGCGGG 
TCCCACTGGAGAGGTGCGCCGCCTACCGCAGCCAGGGGGCATGCCTGGGGGCCCGGGACCCGTACTGTGGCTGGGAC 
GGGAAGCAGCAACGTTGCAGCACACTCGAGGACAGCTCCAACATGAGCCTCTGGACCCAGAACATCACCGCCTGTCC 
TGTGCGGAATGTGACACGGGATGGGGGCTTCGGCCCATGGTCACCATGGCAACCATGTGAGCACTTGGATGGGGACA 
ACTCAGGCTCTTGCCTGTGTCGAGCTCGATCCTGTGATTCCCCTCGACCCCGCTGTGGGGGCCTTGACTGCCTGGGG 
CCAGCCATCCACATCGCCAACTGCTCCAGGAATGGGGCGTGGACCCCGTGGTCATCGTGGGCGCTGTGCAGCACGTC 
CTGTGGCATCGGCTTCCAGGTCCGCCAGCGAAGTTGCAGCAACCCTGCTCCCCGCCACGGGGGCCGCATCTTCGTGG 
GCAAGAGCCGGGAGGAACGGTTCTGTAATGAGAACACGCCTTGCCCGGTGCCCATCTTCTGGGCTTCCTGGGGCTCC 
TGGAGCAAGTGCAGCAGCAACTGTGGAGGGGGCATGCAGTCGCGGCGTCGGGCCTGCGAGAACGGCAACTCCTGCCT 
GGGCTGCGGCGAGTTCAAGACGTGCAACCCCGAGGGCTGCCCCGAAGTGCGGCGCAACACCCCCTGGACGCCGTGGC 
TGCCCGTGAACGTGACGCAGGGCGGGGCACGGCAGGAGCAGCGGTTCCGCTTCACCTGCCGCGCGCCCCTTGCAGAC 
CCGCACGGCCTGCAGTTCGGCAGGAGAAGGACCGAGACGAGGACCTGTCCCGCGGACGGCTCCGGCTCCTGCGACAC 
CGACGCCCTGGTGGAGGTCCTCCTGCGCAGCGGGAGCACCTCCCCGCACACGGTGAGCGGGGGCTGGGCCGCCTGGG 
GCCCGTGGTCGTCCTGCTCCCGGGACTGCGAGCTGGGCTTCCGCGTCCGCAAGAGAACGTGCACTAACCCGGAGCCC 
CGCAACGGGGGCCTGCCCTGCGTGGGCGATGCTGCCGAGTACCAGGACTGCAACCCCCAGGCTTGCCCAGTTCGGGG 
TGCTTGGTCCTGCTGGACCTCATGGTCTCCATGCTCAGCTTCCTGTGGTGGGGGTCACTATCAACGCACCCGTTCCT 
GCACCAGCCCCGCACCCTCCCCAGGTGAGGACATCTGTCTCGGGCTGCACACGGAGGAGGCACTATGTGCCACACAG 
GCCTGCCCAGGCTGGTCGCCCTGGTCTGAGTGGAGTAAGTGCACTGACGACGGAGCCCAGAGCCGAAGCCGGCACTG 
TGAGGAGCTCCTCCCAGGGTCCAGCGCCTGTGCTGGAAACAGCAGCCAGAGCCGCCCCTGCCCCTACAGCGAGATTC 
CCGTCATCCTGCCAGCCTCCAGCATGGAGGAGGCCACCGACTGTGCAGGTAAAAGAAACCGGACCTACCTCATGCTG 
CGGTCCTCCCAGCCCTCCAGCACCCCACTCCAAAGTCTGGACTCTTTCCACATCCTGCTCCAGACAGCCAAGCTTTG 
TTGGGGTCCCCACTGCTTTGAGATGGGTTCAATCTCATCCACTTGGTGGCCACGGGCATCTCCTGCTTCTTGGGCTC 
TGGGCTCCTGACCCTAGCAGTGTACCTGTCTTGCCAGCACTGCCAGCGTCAGTCCCAGGAGTCCACACTGGTCCATC 
CTGCCACCCCCAACCATTTGCACTACAAGGGCGGAGGCACCCCGAAGAATGAAAAGTACACACCCATGGAATTCAAG 
ACCCTGAACAAGAATAACTTGATCCCTGATGACAGAGCCAACTTCTACCCATTGCAGCAGACCAATGTGTACACGAC 
TACTTACTACCCAAGCCCCCTGAACAAACACAGCTTCCGGCCCGAGGCCTCACCTGGACAACGGTGCTTCCCCAACA 
GCTGATACCGCCGTCCTGGGGACTTGGGCTTCTTGCCTTCATAAGGCACAGAGCAGATGGAGATGGGACAGTGGAGC 
CAGTTTGGTTTTCTCCCTCTGCACTAGGCCAAGAACTTGCTGCCTTGCCTGTGGGGGGTCCCATCCGGCTTCAGAGA 
GCTCTGGCTGGCATTGACCATGGGGGAAAGGGCTGGTTTCAGGCTGACATATGGCCGCAGGTCCAGTTC 
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AGCCCAGGTCTCTCATGGTTATCTTCCAACCCACTGTCACGCTGACACTATGCTGCCATGCCTGGGCTGTGGACCTA 
CTGGGCATTTGAGGAATTGGAGAATGGAGATGGCAAGAGGGCAGGCTTTTAAGTTTGGGTTGGAGACAACTTCCTGT 
GGCCCCCACAAGCTGAGTCTGGCCTTCTCCAGCTGGCCCCAAAAAAGGCCTTTGCTACATCCTGATTATCTCTGAAA 
GTAATCAATCAAGTGGCTCCAGTAGCTCTGGATTTTCTGCCAGGGCTGGGCCATTGTGGTGCTGCCCCAGTATGACA 
TGGGACCAAGGCCAGCGCAGGTTATCCACCTCTGCCTGGAAGTCTATACTCTACCCAGGGCATCCCTCTGGTCAGAG 
GCAGTGAGTACTGGGAACTGGAGGCTGACCTGTGCTTAGAAGTCCTTTAATCTGGGCTGGTACAGGCCTCAGCCTTG 
CCCTCAATGCACGAAAGGTGGCCCAGGAGAGAGGATCAATGCCATAGGAGGCAGAAGTCTGGCCTCTGTGCCTCTAT 
GGAGACTATCTTCCAGTTGCTGCTCAACAGAGTTGTTGGCTGAGACCTGCTTGGGAGTCTCTGCTGGCCCTTCATCT 
GTTCAGGAACACACACACACACACACTCACACACGCACACACAATCACAATTTGCTACAGCAACAAAAAAGACATTG 
GGCTGTGGCATTAT T AAT T AAAG AT G AT AT C C AG T C 
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CGCAGAAAGAGGAGGCGCTTGCCTTCAGCTTGTGGGAAATCCCGAAG ATG GCCAAAGACAACTCAACTGTTCGTTGC 
TTCCAGGGCCTGCTGATTTTTGGAAATGTGATTATTGGTTGTTGCGGCATTGCCCTGACTGCGGAGTGCATCTTCTT 
TGTATCTGACCAACACAGCCTCTACCCACTGCTTGAAGCCACCGACAACGATGACATCTATGGGGCTGCCTGGATCG 
GCATATTTGTGGGCATCTGCCTCTTCTGCCTGTCTGTTCTAGGCATTGTAGGCATCATGAAGTCCAGCAGGAAAATT 
CTTCTGGCGTATTTCATTCTGATGTTTATAGTATATGCCTTTGAAGTGGCATCTTGTATCACAGCAGCAACACAACA 
AGACTTTTTCACACCCAACCTCTTCCTGAAGCAGATGCTAGAGAGGTACCAAAACAACAGCCCTCCAAACAATGATG 
ACCAGTGGAAAAACAATGGAGTCACCAAAACCTGGGACAGGCTCATGCTCCAGGACAATTGCTGTGGCGTAAATGGT 
CCATCAGACTGGCAAAAATACACATCTGCCTTCCGGACTGAGAATAATGATGCTGACTATCCCTGGCCTCGTCAATG 
CTGTGTTATGAACAATCTTAAAGAACCTCTCAACCTGGAGGCTTGTAAACTAGGCGTGCCTGGTTTTTATCACAATC 
AGGGCTGCTATGAACTGATCTCTGGTCCAATGAACCGACACGCCTGGGGGGTTGCCTGGTTTGGATTTGCCATTCTC 
TGCTGGACTTTTTGGGTTCTCCTGGGTACCATGTTCTACTGGAGCAGAATTGAATAT TAA GCATAAAGTGTTGCCAC 
CATACCTCCTTCCCCGAGTGACTCTGGATTTGGTGCTGGAACCAGCTCTCTCCTAATATTCCACGTTTGTGCCCCAC 
ACTAACGTGTGTGTCTTACATTGCCAAGTCAGATGGTACGGACTTCCTTTAGGATCTCAGGCTTCTGCAGTTCTCAT 
GACTCCTACTTTTCATCCTAGTCTAGCATTCTGCAACATTTATATAGACTGTTGAAAGGAGAATTTGAAAAATGCAT 
AATAACTACTTCCATCCCTGCTTATTTTTAATTTGGGAAAATAAATACATTCGAAGGAACCTGTGTTATCACAGTAA 
CCCAGAGCTGTATTTGGCTAGCAATCTGCCTGTATCTCTCACTATTATCTAAAAGAAACCTTCCAATGCTTCTGTTG 
ATCTCAGTATTGTCAGGGGAACAGAGAAGTTGGGAAAAGATTACTGAAATATACCTTTTGCATTTCTTTCTAGAGTA 
GCTCCCATATATGGAGATGGGTGATTCTCTTGATGCCACCTTCAGATCCTTTTATTCTCCAGAATAATTCTTAACAG 
TGGTTCAAATTTCCTTTCATACCTTGAAGTATGTGTTTAGTAGCCTCAATTCTCCATTAATTAAAAGTGTGGGCTGG 
GCGTGGGGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCAGATCACCTGAGGTCAGGAGTTCAAG 
ACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACAAAAATACAAAAATTAGCCAGGCGTGATGGCAGGTGCCTGT 
AATCCTAGCTACTTGGCAGGCTAACGCAGGAGAATCACTTGACCGGGAGACAGAGGTTGCAGTGAGCTGAGATCGTA 
CCTATTGCACTCCATCCTGGATGAAAGAGCCAGACTCTGTCTCAAAACAAACAAAAAAGCGTGGGGACTTCTGGGGA 
CAGACAAGGTGCCTGTTATATATTTACTCAGTCTTTGCCCTGAATGGTCTCAGCTTGAGACCATTTCAAACTGGAGA 
GAAGCAAGCCAGCCAATAGAATGGGGTGATTTACAGGGATTTCTGTTTACTGTCAAAATATTTCTCATCTGCACTAT 
GTTTCCATTTGTGGTCCTGAAGGAAATTCTTATAACTCAACATTTGTCTGGTCTTATAAGTAAAGACAGCTTTAAAA 
TCTGTTCACTTTCAAA 
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CCCGGCCCGGGCTCGAGAATCAAGGGCCTCGGCCGCCGTCCCGCAGCTCAGTCCATCGCCCTTGCCGGGCAGCCCGG 
GCAGAGACC ATG TTTGACAAGACGCGGCTGCCGTACGTGGCCCTCGATGTGCTCTGCGTGTTGCTGGCTGGATTGCC 
TTTTGCAATTTTTACTTCAAGGCATATTACTTCAAGGCATACCCCCTTCCAACGAGGAGTATTCTGTAATGATGAGT 
CCATCAAGTACCCTTACAAAGAAGACACCATACCTTATGCGTTATTAGGTGGAATAATCATTCCATTCAGTATTATC 
GTTATTATTCTTGGAGAAACCCTGTCTGTTTACTGTAACCTTTTGCACTCAAATTCCTTTATCAGGAATAACTACAT 
AGCCACTATTTACAAAGCCATTGGAACCTTTTTATTTGGTGCAGCTGCTAGTCAGTCCCTGACTGACATTGCCAAGT 
ATTCAATAGGCAGACTGCGGCCTCACTTCTTGGATGTTTGTGATCCAGATTGGTCAAAAATCAACTGCAGCGATGGT 
TACATTGAATACTACATATGTCGAGGGAATGCAGAAAGAGTTAAGGAAGGCAGGTTGTCCTTCTATTCAGGCCACTC 
TTCGTTTTCCATGTACTGCATGCTGTTTGTGGCACTTTATCTTCAAGCCAGGATGAAGGGAGACTGGGCAAGACTCT 
TACGCCCCACACTGCAATTTGGTCTTGTTGCCGTATCCATTTATGTGGGCCTTTCTCGAGTTTCTGATTATAAACAC 
CACTGGAGCGATGTGTTGACTGGACTCATTCAGGGAGCTCTGGTTGCAATATTAGTTGCTGTATATGTATCGGATTT 
CTTCAAAGAAAGAACTTCTTTTAAAGAAAGAAAAGAGGAGGACTCTCATACAACTCTGCATGAAACACCAACAACTG 
GGAATCACTATCCGAGCAATCACCAGCCTTGAAAGGCAGCAGGGTGCCCAGGTGAAGCTGGCCTGTTTTCTAAAGGA 
AAATGATTGCCACAAGGCAAGAGGATGCATCTTTCTTCCTGG 
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AGTGAAGGGGTTTCCCATATGAAAAATACAGT^AGAATTATTTGAATACTAGCAAATACACAACTTGATATTTCTAG 
AGAACCCAGGCACAGTCTTGGAGACATTACTCCTGAGAGACTGCAGCTGATGGAAGATGAGCCCCAACTTCTAAAAA 
TGTATCACTACCGGGATTGAGATACAAACAGCATTTAGGAAGGTCTCATCTGAGTAGCAGCTTCCTGCCCTCCTTCT 
TGGAGATAAGTCGGGCTTTTGGTGAGACAGACTTTCCCAACCCTCTGCCCGGCCGGTGCCC ATG CTTCTGTGGCTGC 
TGCTGCTGATCCTGACTCCTGGAAGAGAACAATCAGGGGTGGCCCCAAAAGCTGTACTTCTCCTCAATCCTCCATGG 
TCCACAGCCTTCAAAGGAGAAAAAGTGGCTCTCATATGCAGCAGCATATCACATTCCCTAGCCCAGGGAGACACATA 
TTGGTATCACGATGAGAAGTTGTTGAAAATAAAACATGACAAGATCCAAATTACAGAGCCTGGAAATTACCAATGTA 
AGACCCGAGGATCCTCCCTCAGTGATGCCGTGCATGTGGAATTTTCACCTGACTGGCTGATCCTGCAGGCTTTACAT 
CCTGTCTTTGAAGGAGACAATGTCATTCTGAGATGTCAGGGGAAAGACAACAAAAACACTCATCAAAAGGTTTACTA 
CAAGGATGGAAAACAGCTTCCTAATAGTTATAATTTAGAGAAGATCACAGTGAATTCAGTCTCCAGGGATAATAGCA 
AATATCATTGTACTGCTTATAGGAAGTTTTACATACTTGACATTGAAGTAACTTCAAAACCCCTAAATATCCAAGTT 
CAAGAGCTGTTTCTACATCCTGTGCTGAGAGCCAGCTCTTCCACGCCCATAGAGGGGAGTCCCATGACCCTGACCTG 
TGAGACCCAGCTCTCTCCACAGAGGCCAGATGTCCAGCTGCAATTCTCCCTCTTCAGAGATAGCCAGACCCTCGGAT 
TGGGCTGGAGCAGGTCCCCCAGACTCCAGATCCCTGCCATGTGGACTGAAGACTCAGGGTCTTACTGGTGTGAGGTG 
GAGACAGTGACTCACAGCATCAAAAAAAGGAGCCTGAGATCTCAGATACGTGTACAGAGAGTCCCTGTGTCTAATGT 
GAATCTAGAGATCCGGCCCACCGGAGGGCAGCTGATTGAAGGAGAAAATATGGTCCTTATTTGCTCAGTAGCCCAGG 
GTTCAGGGACTGTCACATTCTCCTGGCACAAAGAAGGAAGAGTAAGAAGCCTGGGTAGAAAGACCCAGCGTTCCCTG 
TTGGCAGAGCTGCATGTTCTCACCGTGAAGGAGAGTGATGCAGGGAGATACTACTGTGCAGCTGATAACGTTCACAG 
CCCCATCCTCAGCACGTGGATTCGAGTCACCGTGAGAATTCCGGTATCTCACCCTGTCCTCACCTTCAGGGCTCCCA 
GGGCCCACACTGTGGTGGGGGACCTGCTGGAGCTTCACTGTGAGTCCCTGAGAGGCTCTCCCCCGATCCTGTACCGA 
TTTTATCATGAGGATGTCACCCTGGGGAACAGCTCAGCCCCCTCTGGAGGAGGAGCCTCCTTCAACCTCTCTCTGAC 
TGCAGAACATTCTGGAAACTACTCCTGTGATGCAGACAATGGCCTGGGGGCCCAGCACAGTCATGGAGTGAGTCTCA 
GGGTCACAGTTCCGGTGTCTCGCCCCGTCCTCACCCTCAGGGCTCCCGGGGCCCAGGCTGTGGTGGGGGACCTGCTG 
GAGCTTCACTGTGAGTCCCTGAGAGGCTCCTTCCCGATCCTGTACTGGTTTTATCACGAGGATGACACCTTGGGGAA 
CATCTCGGCCCACTCTGGAGGAGGGGCATCCTTCAACCTCTCTCTGACTACAGAACATTCTGGAAACTACTCATGTG 
AGGCTGACAATGGCCTGGGGGCCCAGCACAGTAAAGTGGTGACACTCAATGTTACAGGAACTTCCAGGAACAGAACA 
GGCCTTACCGCTGCGGGAATCACGGGGCTGGTGCTCAGCATCCTCGTCCTTGCTGCTGCTGCTGCTCTGCTGCATTA 
CGCCAGGGCCCGAAGGAAACCAGGAGGACTTTCTGCCACTGGAACATCTAGTCACAGTCCTAGTGAGTGTCAGGAGC 
CTTCCTCGTCCAGGCCTTCCAGGATAGACCCTCAAGAGCCCACTCACTCTAAACCACTAGCCCCAATGGAGCTGGAG 
CCAATGTACAGCAATGTAAATCCTGGAGATAGCAACCCGATTTATTCCCAGATCTGGAGCATCCAGCATACAAAAGA 
AAACTCAGCTAATTGTCCAATGATGCATCAAGAGCATGAGGAACTTACAGTCCTCTATTCAGAACTGAAGAAGACAC 
ACCCAGACGACTCTGCAGGGGAGGCTAGCAGCAGAGGCAGGGCCCATGAAGAAGATGATGAAGAAAACTATGAGAAT 
GTACCACGTGTATTACTGGCCTCAGACCACTAGCCCCTTACCCAGAGTGGCCCACAGGAAACAGCCTGCACCATTTT 
TTTTTCTGTTCTCTCCAACCACACATCATCCATCTCTCCAGACTCTGCCTCCTACGAGGCTGGGCTGCAGGGTATGT 
GAGGCTGAGCAAAAGGTCTGCAAATCTCCCCTGTGCCTGATCTGTGTGTTCCCCAGGAAGAGAGCAGGCAGCCTCTG 
AGCAAGCACTGTGTTATTTTCACAGTGGAGACACGTGGCAAGGCAGGAGGGCCCTCAGCTCCTAGGGCTGTCGAATA 
GAGGAGGAGAGAGAAATGGTCTAGCCAGGGTTACAAGGGCACAATCATGACCATTTGATCCAAGTGTGATCGAAAGC 
TGTTAATGTGCTCTCTGTATAAACAATTTGCTCCAAATATTTTGTTTCCCTTTTTTGTGTGGCTGGTAGTGGCATTG 
CTGATGTTTTGGTGTATATGCTGTATCCTTGCTACCATATTGGG 
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GGAGCCGCGCCGCATCTCAGGCGCAGTCTCTAGGGGCTGTGCGCATCCTAGGGGGGGACATGTGCATCTCAGGGGGG 
CTGCTCGCATCTGGGGGGTGCTGTGTGCATCTCGGGGGGGCTGTTGCATCTACGCGGGGTGGCTGTGTCCGCATCTG 
GAGGGGGCTGTGCGCAACCCGGGGGGGGTGTTGCGCGCATCTAGCAGGGGCGGCTGTGCGCATTTCGGGGGGGGGCT 
GTGCATATCTGGGGGGACACGTGCTTATCTCTGGGGGCGGCTGTGCGCATCTTGAGGGGTGTGTACATCTCGGGGGG 
CCTGTGCATCTTGGGGGGCTGTGTGCATCCGCGGGGGCTGTGCGCATCTCGGGGTGCTGTGCGCTGCTCCTCTGAGC 
TCTGCTCTTTCTTGCAGCGTTTGCCTCAGCCATGGAGGGCGGGGCCGCGGCAGCCACCCCCACAGCACTGCCTTACT 
ACGTGGCCTTCTCCCAGCTGCTGGGCCTGACCTTGGTGGCCATGACCGGCGCGTGGCTCGGGCTGTACCGAGGCGGC 
ATTGCCTGGGAGAGCGACCTGCAGTTCAACGCGCACCCCCTCTGCATGGTCATAGGCCTGATCTTCCTGCAGGGAAA 
TGCCCTGCTGGTTTACCGTGTCTTCAGGAACGAAGCTAAACGCACCACCAAGGTCCTGCACGGGCTGCTGCACATCT 
TTGCGCTCGTCATCGCCCTGGTTGGCTTGGTGGCGGTGTTCGACTACCACAGGAAGAAGGGCTACGCTGACCTGTAC 
AGCCTACACAGCTGGTGCGGGATCCTTGTCTTTGTCCTGTACTTTGTGCAGTGGCTGGTGGGCTTCAGCTTCTTCCT 
GTTCCCCGGAGCTTCATTCTCCCTGCGGAGCCGCTACCGCCCACAGCACATCTTCTTTGGTGCTACCATCTTCCTCC 
TTCCCGTGGGCACCGCCCTGCTGGGCCTGAAGGAGGCACTGCTGTTCAACCTCGGGGGCAAGTATAGCGCATTTGAG 
CCCGAGGGTGTCCTGGCCAACGTGCTGGGCCTGCTGCTGGCCTGCTTCGGTGGGGCGGTGCTCTACATCTTGACCCG 
GGCCGACTGGAAGCGGCCTTCCCAGGCGGAAGAGCAGGCCCTCTCCATGGACTTCAAGACGCTGAGGCAGGGAGATA 
GCCCCGGCTCCCAGTGATGCGCCCGGCCGGCCCTGGGGGTTCGCGGGGTGTCTTCTTGCCTGCCCCTGCTGAGGCGT 
CTTCAGGACTGCAGGCTCCGGAGAGTGGCTCTGGCAGCAGGCGGGCGCGTGGGTGCAGGGGGATCCGTTTGATGCGT 
CGTTTCTGGGGCAGGTCTCCGCCTCCTCTGCTTCTCGTTTCTCCGCTGCTATAGACCAGTTCATTGTGTGTGGCTCC 
CGTGTCTCTGTTGCCCCCTTCAGTGCAGAAGGCTTTGGGTAGACTTCGGGTGTTCGGTCCTGGTCGCAGAGCACAGA 
TCTTTAAAGAAGCGTTAGAGAGGTAGGTTCTACCCTCTTGGTAGTAGATGCCTGGGGCAAGGCCCAGGGGAAACTGG 
GGGGGCCTCAGGGACAGGCCTGGAAAGGCCACGATGGCCTGCTGAATTCAAACAAGGAGTCCCTCCAGCCTGAATAA 
CACGTGGCACAAATGGGCCCGGCCTTTGGCAGAGGAGCAAGTGATATGATGTGTAAAGTATGTTGGTGGTGAAAGCA 
AGGTTCCCCAGGAGAGGGGAGGGACTGGCCCCTGGGAAGCTGTGAGATGAGGCTGTGGCCCAGCTGTAGTCCTGACC 
TTACTCTTCTTTAAAACCCTTTAGCCCTAGGATGGCTTTGGTGGGAGAGGGGATAGAAGCCCATGACTTCAGACAGA 
CTTTCTCTTGGCAGATGCAGGCAGGCCTCCTCCCAGGCTGCTCCAGACATGGGGGTTGGGGATGGGGGGTACCTTGC 
AGCCCCTTCCTGCTGGGGCTCCCTCCTTGTAGCACCCCCTTGCGGCTCAGCTCTGGTTTCCTCTCCCAGGCTCACCC 
AGGCTCTGCTCAGGCTGGGAGGCAGAGGGCACAAACCTTATAATTTTTTAAATGAAAAACCGCTGCTGCTGGCTGTG 
GCTAGAGCCCCCTGGGGCTGCTGGAGCTGCTGCCTCTGTTCTGGAGGACGAGCCTTCTCCTTATCTGCTGCCCATCT 
TTCCAGGAAGTCAGGATGGAGTCAGACAACTAACGATCATCCCCCGTGGGTGTCTGCACATCACTCCAGCCCCATAA 
AGAGTGTCATGTTAGCTGAGTCACCATTTGGCTTCGGCCTGGAAATAGTGTGTTAGAACACTGATCGTGTGCGAGGC 
CAGGAGATCAAGACCATCCTGACTAACAAACACAGTGAAACCCCGTCTCTACTAAAAATACAAAAAAATTAGCCAGG 
CGTGGTGGTGGGCGCCTGTAGTCCCAGCTACTTGGGAGACAGGTCTCTACGGAATTCCCTGTATTAGTCTATATGGT 
TCTCCAAGAAACTGAATGAATCCATTGGAGAAGCGGTGGATAACTAGCCACGACAAAATTTGAGAATACATAAACAA 
CGCATTCCGCAGGAAACATACAGAGGATGCCTTTTCTGTGATTGGGTGGGATTTTTTCCCTTTTTATGTGGATATAG 
TAGTTACTTGTGACAAGAATAATTTTGGAATAATTTCTATTAATATCAACTCTGAAGCTAATTGTACATAATCTCGA 
GATTGTGTTTGTTCATAATAAAAGTGAAGTGAATGTG 
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CGCCACCGCTGGGTGCGGCGAGGCCGGCGCGATGCGGCAGCTGTGCCGGGGCCGCGTGCTGGGCATCTCGGTGGCCA 
TCGCGCACGGGGTCTTCTCGGGCTCCCTCAACATCTTGCTCAAGTTCCTCATCAGCCGCTACCAGTTCTCCTTCCTG 
ACCCTGGTGCAGTGCCTGACCAGCTCCACCGCGGCGCTGAGCCTGGAGCTGCTGCGGCGCCTCGGGCTCATCGCCGT 
GCCCCCCTTCGGTCTGAGCCTGGCGCGCTCCTTCGCGGGGGTCGCGGTGCTCTCCACGCTGCAGTCCAGCCTCACGC 
TCTGGTCCCTGCGCGGCCTCAGCCTGCCCATGTACGTGGTCTTCAAGCGCTGCCTGCCCCTGGTCACCATGCTCATC 
GGCGTCCTGGTGCTCAAGAACGGCGCGCCCTCGCCAGGGGTGCTGGCGGCGGTGCTCATCACCACCTGCGGCGCCGC 
CCTGGCAGGAGCCGGCGACCTGACGGGCGACCCCATCGGGTACGTCACGGGAGTGCTGGCGGTGCTGGTGCACGCTG 
CCTACCTGGTGCTCATCCAGAAGGCCAGCGCAGACACCGAGCACGGGCCGCTCACCGCGCAGTACGTCATCGCCGTC 
TCTGCCACCCCGCTGCTGGTCATCTGCTCCTTCGCCAGCACCGACTCCATCCACGCCTGGACCTTCCCGGGCTGGAA 
GGACCCGGCCATGGTCTGCATCTTCGTGGCCTGCATCCTGATCGGCTGCGCCATGAACTTCACCACGCTGCACTGCA 
CCTACATCAATTCGGCCGTGACCACCTCTCTGTTCATTGCCGGCGTGGTGGTGAACACCCTGGGCTCTATCATTTAC 
TGTGTGGCCAAGTTCATGGAGACCAGAAAGCAAAGCAACTACGAGGACCTGGAGGCCCAGCCTCGGGGAGAGGAGGC 
GCAGCTAAGTGGAGACCAGCTGCCGTTCGTGATGGAGGAGCTGCCCGGGGAGGGAGGAAATGGCCGGTCAGAAGGTG 
GGGAGGCAGCAGGTGGCCCCGCTCAGGAGAGCAGGCAAGAGGTCAGGGGCAGCCCCCGAGGAGTCCCGCTGGTGGCT 
GGGAGCTCTGAAGAAGGGAGCAGGAGGTCGTTAAAAGATGCTTACCTCGAGGTATGGAGGTTGGTTAGGGGAACCAG 
GTATATGAAGAAGGATTATTTGATAGAAAACGAGGAGTTACCCAGTCCTTGAGAAGGAGGTGCATGTACGTACCTAT 
GTGCATACACTTATTTTATATGTTAGAAATGACGTGTTTTAATGAGAGGCCTCCCCGTTTTATTCTTTGAGGAGTGG 
GGAAGGGAAGAAAAGAAAGAAGCTGAAAGGTACTGACACAGAGCAACAAAATTAGCACCTGTGTGAATTATTTAGTG 
TGACTTCACCTGAGGCATCACAGAGACAAAAGAATGTGAAGCTACTTAACAAAGTAAGGCAACGTTTCTGCTTCAGA 
CTCCTGGCACATTTACTTTTTGTCATTATAACCATAACTAAATATCTGCATGTACCAAGAGTCCCTAAGCCACCCCC 
TCCAAAGATGGAGTGTAGAAATGATGACAGCACTTAGTAAGTTCAAAGATGACATTCAGGGATGCATTTTTTGATGA 
TAGAACTACAGTTTTTATCGCCAGCTGGGCAAAGAGTATATTGCTGAAATGATATATAAATATATTGAATTGATGTT 
TACTGTTTATAGTCATCTGAAATATCATATTTACTCTGATTCTACTCACTTGTTTTTTAAAAATAAGTGTCCTACTA 
TTGTATTATATATTGATAGAAACTGTTAAAGCTATTTTGAAAATATGAGTTCTTAGCTTTAATCATGAAGTCTGAAG 
TTTGCTTTCAGTAATTATTTTAAAAGTTGTTTTGGTTCATTGCTTTATAATATTTATTATTGAATGCCAAACCTGTT 
CTTTTTTTTACTGTGTCCAATATTCTTTCAAGCAAATGCAATGGCTGGAATATAATTCAGAATTAACTGAAACCCAG 
CCAGAAGAGGGACCACCTGTAAAGCAAGTCCTTTCAAGTTTCACTGCACATCCCAAACCATGTTACAAAAAGAGCAA 
CTGCTATATTCACATTATGATATTTTTCTATCTTAAATTTGTCAAAATAAAGTATGAGTCTAACTATTAAAAAAAAA 
AAAAAAAAA 
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ATTTCTCCAGGAGAACTCCACACTTGCACAATATCTTCTCCAAGGACAGAAGAGGGACAATATTTGTCCTTCCTACT 
GGATTTTGAAACCCTTTGCACTGAGTGTAGACTGTAGTTCTGTGACATACCATGAAGAGTGTGTGTGTGTTTTAAGG 
GAGCTACTGTCTTACCCAAAACCTGTGAATATAAAGTGTTTTTTCATGAATTGCTCATTATTCAGCCAGTCGTTAAT 
GAATTCATTCAACAAGTGTCTCTGAGATGCTAGACACTGGGGATTAAAAGAGGAACAACAGAGACAAGATCTCTGCC 
CTCCAGAAACTGACAGTCTATTGAATGAGACAGTTGTCTAACAATCACAATCAAGTGTGATCAATCTCTGGTGACAG 
GACTCTAACCTAGAGGCATGTGCCTAATCTGGGGTGACT 
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CAGGGGACAGGCTGCAGCCGGTGCAGTTACACGTTTTCCTCCAAGGAGCCTCGGACGTTGTCACGGGTTTGGGGTCG 
GGGACAGAGCAGTGACCATGGCCAGGCTGGCGTTGTCTCCTGTGCCCAGCCACTGGATGGTGGCGTTGCTGCTGCTG 
CTCTCAGCTGAGCCAGTACCAGCAGCCAGATCGGAGGACCGGTACCGGAATCCCAAAGGTAGTGCTTGTTCGCGGAT 
CTGGCAGAGCCCACGTTTCATAGCCAGGAAACGGGGCTTCACGGTGAAAATGCACTGCTACATGAACAGCGCCTCCG 
GCAATGTGAGCTGGCTCTGGAAGCAGGAGATGGACGAGAATCCCCAGCAGCTGAAGCTGGAAAAGGGCCGCATGGAA 
GAGTCCCAGAACGAATCTCTCGCCACCCTCACCATCCAAGGCATCCGGTTTGAGGACAATGGCATCTACTTCTGTCA 
GCAGAAGTGCAACAACACCTCGGAGGTCTACCAGGGCTGCGGCACAGAGCTGCGAGTCATGGGATTCAGCACCTTGG 
CACAGCTGAAGCAGAGGAACACGCTGAAGGATGGTATCATCATGATCCAGACGCTGCTGATCATCCTCTTCATCATC 
GTGCCTATCTTCCTGCTGCTGGACAAGGATGACAGCAAGGCTGGCATGGAGGAAGATCACACCTACGAGGGCCTGGA 
CATTGACCAGACAGCCACCTATGAGGACATAGTGACGCTGCGGACAGGGGAAGTGAAGTGGTCTGTAGGTGAGCACC 
CAGGCCAGGAGTGAGAGCCAGGTCGCCCCATGACCTGGGTGCAGGCTCCCTGGCCTCAGTGACTGCTTCGGAGCTGC 
CTGGCTCATGGCCCAACCCCTTTCCTGGACCCCCCAGCTGGCCTCTGAAGCTGGCCCACCAGAGCTGCCATTTGTCT 
CCAGCCCCTGGTCCCCAGCTCTTGCCAAAGGGCCTGGAGTAGAAGGACAACAGGGCAGCAACTTGGAGGGAGTTCTC 
TGGGGATGGACGGGACCCAGCCTTCTGGGGGTGCTATGAGGTGATCCGTCCCCACACATGGGATGGGGGAGGCAGAG 
ACTGGTCCAGAGCCCGCAAATGGACTCGGAGCCGAGGGCCTCCCAGCAGAGCTTGGGAAGGGCCATGGACCCAACTG 
GGCCCCAGAAGAGCCACAGGAACATCATTCCTCTCCCGCAACCACTCCCACCCCAGGGAGGCCCTGGCCTCCAGTGC 
CTTCCCCCGTGGAATAAACGGTGTGTCCTGAGAAACCA 
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ATGGGCCTCCCCGAGCCGGGCCCTCTCCGGCTTCTGGCGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCGGCT 
CCAGCATCTTGCGGCGGCAGCGGCTGATCCGCTGCTCGGCGGCCAAGGGCCGGCCAAGGAGTGCGAAAAGGACCAAT 
TCCAGTGCCGGAACGAGCGCTGCATCCCCTCTGTGTGGAGATGCGACGAGGACGATGACTGCTTAGACCACAGCGAC 
GAGGACGACTGCCCCAAGAAGACCTGTGCAGACAGTGACTTCACCTGTGACAACGGCCACTGCATCCACGAACGGTG 
GAAGTGTGACGGCGAGGAGGAGTGTCCTGATGGCTCCGATGAGTCCGAGGCCACTTGCACCAAGCAGGTGTGTCCTG 
CAGAGAAGCTGAGCTGTGGACCCACCAGCCACAAGTGTGTACCTGCCTCGTGGCGCTGCGACGGGGAGAAGGACTGC 
GAGGGTGGAGCGGATGAGGCCGGCTGTGCTACCTCACTGGGCACCTGCCGTGGGGACGAGTTCCAGTGTGGGGATGG 
GACATGTGTCCTTGCAATCAAGCACTGCAACCAGGAGCAGGACTGTCCAGATGGGAGTGATGAAGCTGGCTGCCTAC 
AGGGGCTGAACGAGTGTCTGCACAACAATGGCGGCTGCTCACACATCTGCACTGACCTCAAGATTGGCTTTGAATGC 
ACGTGCCCAGCAGGCTTCCAGCTCCTGGACCAGAAGACTTGTGGCGACATTGATGAGTGCAAGGACCCAGATGCCTG 
CAGCCAGATCTGTGTCAATTACAAGGGCTATTTTAAGTGTGAGTGCTACCCTGGCTGCGAGATGGACCTACTGACCA 
AGAACTGCAAGGCTGCTGCTGGCAAGAGCCCATCCCTAATCTTCACCAACCGCACGAGTGCGGAGGATCGACCTGTG 
AAGCGGAACTATTCACGCCTCATCCCCATGCTCAAGAATGTCGTGGCACTAGATGTGGAAGTTGCCACCAATCGCAT 
CTACTGGTGTGACCTCTCCTACCGTAAGATCTATAGCGCCTACATGGACAAGGCCAGTGACCCGAAAGAGCGGGAGG 
TCCTCATTGACGAGCAGTTGCACTCTCCAGAGGGCCTGGCAGTGGACTGGGTCCACAAGCACATCTACTGGACTGAC 
TCGGGCAATAAGACCATCTCAGTGGCCACAGTTGATGGTGGCCGCCGACGCACTCTCTTCAGCCGTAACCTCAGTGA 
ACCCCGGGCCATCGCTGTTGACCCCCTGCGAGGGTTCATGTATTGGTCTGACTGGGGGGACCAGGCCAAGATTGAGA 
AATCTGGGCTCAACGGTGTGGACCGGCAAACACTGGTGTCAGACAATATTGAATGGCCCAACGGAATCACCCTGGAT 
CTGCTGAGCCAGCGCTTGTACTGGGTAGACTCCAAGCTACACCAACTGTCCAGCATTGACTTCAGTGGAGGCAACAG 
AAAGACGCTGATCTCCTCCACTGACTTCCTGAGCCACCCTTTTGGGATAGCTGTGTTTGAGGACAAGGTGTTCTGGA 
CAGACCTGGAGAACGAGGCCATTTTCAGTGCAAATCGGCTCAATGGCCTGGAAATCTCCATCCTGGCTGAGAACCTC 
AACAACCCACATGACATTGTCATCTTCCATGAGCTGAAGCAGCCAAGAGCTCCAGATGCCTGTGAGCTGAGTGTCCA 
GCCTAATGGAGGCTGTGAATACCTGTGCCTTCCTGCTCCTCAGATCTCCAGCCACTCTCCCAAGTACACATGTGCCT 
GTCCTGACACAATGTGGCTGGGTCCAGACATGAAGAGGTGCTACCGAGATGCAAATGAAGACAGTAAGATGGGCTCA 
ACAGTCACTGCCGCTGTTATCGGGATCATCGTGCCCATAGTGGTGATAGCCCTCCTGTGCATGAGTGGATACCTGAT 
CTGGAGAAACTGGAAGCGGAAGAACACCAAAAGCATGAATTTTGACAACCCAGTCTACAGGAAAACAACAGAAGAAG 
AAGATGAAGATGAGCTCCATATAGGGAGAACTGCTCAGATTGGCCATGTCTATCCTGCACGAGTGGCATTAAGCCTT 
GAAGATGATGGACTACCCTGAGGATGGGATCACCCCCTTCGTGCCTCATGGAATTCAGTCCCATGCACTACACTCCG 
GATGGTGTATGACTGGATGAATGGGTTTCTATATATGGGTCTGTGTGAGTGTATGTGTGTGTGTGATTTTTTTTTTT 
AAATTTATGTTGCGGAAAGGTAACCACAAAGTTATGATGAACTGCAAACATCCAAAGGATGTGAGAGTTTTTCTATG 
TATAATGTTTTATACACTTTTTAACTGGTTGCACTACCCATGAGGAATTCGTGGAATGGCTACTGCTGACTAACATG 
ATGCACATAACCAAATGGGGGCCAATGGCACAGTACCTTACTCATCATTTAAAAACTATATTTACAGAAGATGTTTG 
GTTGCTGGGGGGCTTTTTTAGGTTTTGGGCATTTGTTTTTTGTAAATAAGATGATTATGCTTTGTGGCTATCCATCA 
ACATAAGT 
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CCGTTCCGCGCTCTGGCGGCTCCTCCCGGGCGATGCCTCCGCTCTGGGCCCTGCTGGCCCTCGGCTGCCTGCGGTTC 
GGCTCGGCTGTGAACCTGCAGCCCCAACTGGCCAGTGTGACTTTCGCCACCAACAACCCCACACTTACCACTGTGGC 
CTTGGAAAAGCCTCTCTGCATGTTTGACAGCAAAGAGGCCCTCACTGGCACCCACGAGGTCTACCTGTATGTCCTGG 
TCGACTCAGCCATTTCCAGGAATGCCTCAGTGCAAGACAGCACCAACACCCCACTGGGCTCAACGTTCCTACAAACA 
GAGGGTGGGAGGACAGGTCCCTACAAAGCTGTGGCCTTTGACCTGATCCCCTGCAGTGACCTGCCCAGCCTGGATGC 
CATTGGGGATGTGTCCAAGGCCTCACAGATCCTGAATGCCTACCTGGTCAGGGTGGGTGCCAACGGGACCTGCCTGT 
GGGATCCCAACTTCCAGGGCCTCTGTAACGCACCCCTGTCGGCAGCCACGGAGTACAGGTTCAAGTATGTCCTGGTC 
AATATGTCCACGGGCTTGGTAGAGGACCAGACCCTGTGGTCGGACCCCATCCGCACCAACCAGCTCACCCCATACTC 
GACGATCGACACGTGGCCAGGCCGGCGGAGCGGAGGCATGATCGTCATCACTTCCATCCTGGGCTCCCTGCCCTTCT 
TTCTACTTGTGGGTTTTGCTGGCGCCATTGCCCTCAGCCTCGTGGACATGGGGAGTTCTGATGGGGAAACGACTCAC 
GACTCCCAAATCACTCAGGAGGCTGTTCCCAAGTCGCTGGGGGCCTCGGAGTCTTCCTACACGTCCGTGAACCGGGG 
GCCGCCACTGGACAGGGCTGAGGTGTATTCCAGCAAGCTCCAAGACTGAGCCCAGCACCACCCCTGGGCAGCAGCAT 
CCTCCTCTCTGGCCTTGCCCCAGGCCCTGCAGCGGTGGTTGTCACACCCTGACTTCAGGGAAGGTGAAACAGGGCTT 
GTCCCTCCAACTGCAGGAAAACCCTTAATAAAATCTTCTGATGAGTTCTAAAAAAAAA 
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TCTCTGACCACCGGTGCATGCAGCCCCTGTCACATACCGCCTGCTTGCTCAAATCAATCATGACCCTTTCATGTGAA 
ATCTTTAGTATTGTGAGCCCTTAAAAGGGACGGAAATTGTGCATACGTGGAGCTCGGATTTTAAGGCAGTAGCTGCC 
CGATGCTCCCAGCTGAATAAAGCCCTTCCTTCTACAATTTGGTGTCTGAGGGGTTTTGTCTGCGGCTCGTCCTGCTA 
CATTTCTTGGTTCCCTGACCGGGAAGCAAGGTGACTGACAGACGGCCGAGGCAGCCCCTTAGGCGGCTTAAGCCTGC 
CCTGTGGAGCATCCCTGCGGGGGACTCCGGCCAGCCTGAGTGACGCGATCCAAAGAGTGCTCCCGGGTAGGAAATTG 
CCCCGGTGGAACGCCTCGCCAGAGCAGCGTGTAGCAGGCCCCCGCGGAGGATTAACACAGTGGCTGAACACCGGGAA 
GGAACTGGCACTTGGAGTCCAGACATCTGAAACTTGACTGGGAGCTGTACGTGGATGGGAGCAGCTTCACCAACCCC 
TGCAAAGTGACTCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCCACGCACGGCCGAA 
GC ATG AGGAAACTCATCGCAGGACTCATTTTTCTTAAATTTTGGACTTATACAGTAAGGGCTTCAACTGACCTTCCT 
CAGACTGAGAACTGTTTCCAGTATATACATCAAGTCACTGAGATCTCCAGCACCCTGCCGGTGGCACTACTGAGAGA 
CGAGGTGCCAGGGTGGTTCCTGAAAGTGCCTGAGCCCCAACTTATCAGCAAGGAGCTCATCATGCTGACAGAAGTCA 
TGGAGGTCTGGCATGGCTTAGTGATCGCGGTGGTGTCCCTCTTCCTGCAGGCCTGCTTCCTCACCGCCATCAACTAC 
CTGCTCAGCAGGCACATGGCCCACAAGAGTGAACAGATACTGAAAGCGGCCAGTCTCCAGGTTCCCAGGCCCAGCCC 
TGGCCACCATCATCCACCTGCTGTCAAAGAGATGAAGGAGACTCAGACAGAGAGAGACATCCCAATGTCTGATTCCC 
TTTACAGGCATGACAGCGACACACCCTCAGATAGCTTGGATAGCTCCTGCAGTTCGCCTCCTGCCTGCCAGGCCACA 
GAGGATGTGGATTACACACAAGTCGTCTTTTCTGACCCTGGAGAACTAAAATGACTCCCCGCTGGACTATGAGAACA 
TAAAGGAAATCACAGATTATGTCAATGTCAATCCAGAAAGACACAAGCCCAGTTTCTGGTATTTTGTCAACCCTGCT 
TCTTGTCTGAGCCAGCGGAATTATGATCAAGTGGCCATGTGAATTCCAAATATTTTTTAAATGGGGTCCAGTTCTCT 
ATGGNTTTCTTANAATTTAATTTTGTAGGGGNAANTGCCATTTTNCCCCTTTTAAACAANGGNTTGGGGNTAAAAGN 
TTTTTNGGGCCA 
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AAACTTTTGCAGCGGCTGGGTAGCAGCACGTCTCTTGCTCCTCAGGGCCACTGCCAGGCTTGCCGAGTCCTGGGACT 
GCTCTCGCTCCGGCTGCCACTCTCCCGCGCTCTCCTAGCTCCCTGCGAAGCAGG ATG GCCGGGACCGTGCGCACCGC 
GTGCTTGGTGGTGGCGATGCTGCTCAGCTTGGACTTCCCGGGACAGGCGCAGCCCCCGCCGCCGCCGCCGGACGCCA 
CCTGTCACCAAGTCCGCTCCTTCTTCCAGAGACTGCAGCCCGGACTCAAGTGGGTGCCAGAAACTCCCGTGCCAGGA 
TCAGATTTGCAAGTATGTCTCCCTAAGGGCCCAACATGCTGCTCAAGAAAGATGGAAGAAAAATACCAACTAACAGC 
ACGATTGAACATGGAACAGCTGCTTCAGTCTGCAAGTATGGAGCTCAAGTTCTTAATTATTCAGAATGCTGCGGTTT 
TCCAAGAGGCCTTTGAAATTGTTGTTCGCCATGCCAAGAACTACACCAATGCCATGTTCAAGAACAACTACCCAAGC 
CTGACTCCACAAGCTTTTGAGTTTGTGGGTGAATTTTTCACAGATGTGTCTCTCTACATCTTGGGTTCTGACATCAA 
TGTAGATGACATGGTCAATGAATTGTTTGACAGCCTGTTTCCAGTCATCTATACCCAGCTAATGAACCCAGGCCTGC 
CTGATTCAGCCTTGGACATCAATGAGTGCCTCCGAGGAGCAAGACGTGACCTGAAAGTATTTGGGAATTTCCCCAAG 
CTTATTATGACCCAGGTTTCCAAGTCACTGCAAGTCACTAGGATCTTCCTTCAGGCTCTGAATCTTGGAATTGAAGT 
GATCAACACAACTGATCACCTGAAGTTCAGTAAGGACTGTGGCCGAATGCTCACCAGAATGTGGTACTGCTCTTACT 
GCCAGGGACTGATGATGGTTAAACCCTGTGGCGGTTACTGCAATGTGGTCATGCAAGGCTGTATGGCAGGTGTGGTG 
GAGATTGACAAGTACTGGAGAGAATACATTCTGTCCCTTGAAGAACTTGTGAATGGCATGTACAGAATCTATGACAT 
GGAGAACGTACTGCTTGGTCTCTTTTCAACAATCCATGATTCTATCCAGTATGTCCAGAAGAATGCAGGAAAGCTGA 
CCACCACTATTGGCAAGTTATGTGCCCATTCTCAACAACGCCAATATAGATCTGCTTATTATCCTGAAGATCTCTTT 
ATTGACAAGAAAGTATTAAAAGTTGCTCATGTAGAACATGAAGAAACCTTATCCAGCCGAAGAAGGGAACTAATTCA 
GAAGTTGAAGTCTTTCATCAGCTTCTATAGTGCTTTGCCTGGCTACATCTGCAGCCATAGCCCTGTGGCGGAAAACG 
ACACCCTTTGCTGGAATGGACAAGAACTCATGGAGAGATACAGCCAAAAGGCAGCAAGGAATGGAATGAAAAACCAG 
TTCAATCTCCATGAGCTGAAAATGAAGGGCCCTGAGCCAGTGGTCAGTCAAATTATTGACAAACTGAAGCACATTAA 
CCAGCTCCTGAGAACCATGTCTATGCCCAAAGGTAGAGTTCTGGATAAAAACCTGGATGAGGAAGGGTTTGAAAGTG 
GAGACTGCGGTGATGATGAAGATGAGTGCATTGGAGGCTCTGGTGATGGAATGATAAAAGTGAAGAATCAGCTCCGC 
TTCCTTGCAGAACTGGCCTATGATCTGGATGTGGATGATGCGCCTGGAAACAGTCAGCAGGCAACTCCGAAGGACAA 
CGAGATAAGCACCTTTCACAACCTCGGGAACGTTCATTCCCCGCTGAAGCTTCTCACCAGCATGGCCATCTCGGTGG 
TGTGCTTCTTCTTCCTGGTGCACTGACTGCCTGGTGCCCAGCACATGTGCTGCCCTACAGCACCCTGTGGTCTTCCT 
CGATAAAGGGAACCACTTTCTTATTTTTTTCTATTTTTTTTTTTTTGTTATCCTGTATACCTCCTCCAGCCATGAAG 
TAGAGGACTAACCATGTGTTATGTTTTCGAAAATCAAATGGTATCTTTTGGAGGAAGATACATTTTAGTGGTAGCAT 
ATAGATTGTCCTTTTGCAAAGAAAGAAAAAAAACCATCAAGTTGTGCCAAATTATTCTCCTATGTTTGGCTGCTAGA 
ACATGGTTACCATGTCTTTCTCTCTCACTCCCTCCCTTTCTATCGTTCTCTCTTTGCATGGATTTCTTTGAAAAAAA 
ATAAATTGCTCAAATA 
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GCGGAACACCGGCCCGCCGTCGCGGCAGCTGCTTCACCCCTCTCTCTGCAGCCATGGGGCTCCCTCGTGGACCTCTC 
GCGTCTCTCCTCCTTCTCCAGGTTTGCTGGCTGCAGTGCGCGGCCTCCGAGCCGTGCCGGGCGGTCTTCAGGGAGGC 
TGAAGTGACCTTGGAGGCGGGAGGCGCGGAGCAGGAGCCCGGCCAGGCGCTGGGGAAAGTATTCATGGGCTGCCCTG 
GGCAAGAGCCAGCTCTGTTTAGCACTGATAATGATGACTTCACTGTGCGGAATGGCGAGACAGTCCAGGAAAGAAGG 
TCACTGAAGGAAAGGAATCCATTGAAGATCTTCCCATCCAAACGTATCTTACGAAGACACAAGAGAGATTGGGTGGT 
TGCTCCAATATCTGTCCCTGAAAATGGCAAGGGTCCCTTCCCCCAGAGACTGAATCAGCTCAAGTCTAATAAAGATA 
GAGACACCAAGATTTTCTACAGCATCACGGGGCCGGGGGCAGACAGCCCCCCTGAGGGTGTCTTCGCTGTAGAGAAG 
GAGACAGGCTGGTTGTTGTTGAATAAGCCACTGGACCGGGAGGAGATTGCCAAGTATGAGCTCTTTGGCCACGCTGT 
GTCAGAGAATGGTGCCTCAGTGGAGGACCCCATGAACATCTCCATCATCGTGACCGACCAGAATGACCACAAGCCCA 
AGTTTACCCAGGACACCTTCCGAGGGAGTGTCTTAGAGGGAGTCCTACCAGGTACTTCTGTGATGCAGGTGACAGCC 
ACAGATGAGGATGATGCCATCTACACCTACAATGGGGTGGTTGCTTACTCCATCCATAGCCAAGAACCAAAGGACCC 
ACACGACCTCATGTTCACAATTCACCGGAGCACAGGCACCATCAGCGTCATCTCCAGTGGCCTGGACCGGGAAAAAG 
TCCCTGAGTACACACTGACCATCCAGGCCACAGACATGGATGGGGACGGCTCCACCACCACGGCAGTGGCAGTAGTG 
GAGATCCTTGATGCCAATGACAATGCTCCCATGTTTGACCCCCAGAAGTACGAGGCCCATGTGCCTGAGAATGCAGT 
GGGCCATGAGGTGCAGAGGCTGACGGTCACTGATCTGGACGCCCCCAACTCACCAGCGTGGCGTGCCACCTACCTTA 
TCATGGGCGGTGACGACGGGGACCATTTTACCATCACCACCCACCCTGAGAGCAACCAGGGCATCCTGACAACCAGG 
AAGGGTTTGGATTTTGAGGCCAAAAACCAGCACACCCTGTACGTTGAAGTGACCAACGAGGCCCCTTTTGTGCTGAA 
GCTCCCAACCTCCACAGCCACCATAGTGGTCCACGTGGAGGATGTGAATGAGGCACCTGTGTTTGTCCCACCCTCCA 
AAGTCGTTGAGGTCCAGGAGGGCATCCCCACTGGGGAGCCTGTGTGTGTCTACACTGCAGAAGACCCTGACAAGGAG 
AATCAAAAGATCAGCTACCGCATCCTGAGAGACCCAGCAGGGTGGCTAGCCATGGACCCAGACAGTGGGCAGGTCAC 
AGCTGTGGGCACCCTCGACCGTGAGGATGAGCAGTTTGTGAGGAACAACATCTATGAAGTCATGGTCTTGGCCATGG 
ACAATGGAAGCCCTCCCACCACTGGCACGGGAACCCTTCTGCTAACACTGATTGATGTCAACGACCATGGCCCAGTC 
CCTGAGCCCCGTCAGATCACCATCTGCAACCAAAGCCCTGTGCGCCACGTGCTGAACATCACGGACAAGGACCTGTC 
TCCCCACACCTCCCCTTTCCAGGCCCAGCTCACAGATGACTCAGACATCTACTGGACGGCAGAGGTCAACGAGGAAG 
GTGACACAGTGGTCTTGTCCCTGAAGAAGTTCCTGAAGCAGGATACATATGACGTGCACCTTTCTCTGTCTGACCAT 
GGCAACAAAGAGCAGCTGACGGTGATCAGGGCCACTGTGTGCGACTGCCATGGCCATGTCGAAACCTGCCCTGGACC 
CTGGAAAGGAGGTTTCATCCTCCCTGTGCTGGGGGCTGTCCTGGCTCTGCTGTTCCTCCTGCTGGTGCTGCTTTTGT 
TGGTGAGAAAGAAGCGGAAGATCAAGGAGCCCCTCCTACTCCCAGAAGATGACACCCGTGACAACGTCTTCTACTAT 
GGCGAAGAGGGGGGTGGCGAAGAGGACCAGGACTATGACATCACCCAGCTCCACCGAGGTCTGGAGGCCAGGCCGGA 
GGTGGTTCTCCGCAATGACGTGGCACCAACCATCATCCCGACACCCATGTACCGTCCTAGGCCAGCCAACCCAGATG 
AAATCGGCAACTTTATAATTGAGAACCTGAAGGCGGCTAACACAGACCCCACAGCCCCGCCCTACGACACCCTCTTG 
GTGTTCGACTATGAGGGCAGCGGCTCCGACGCCGCGTCCCTGAGCTCCCTCACCTCCTCCGCCTCCGACCAAGACCA 
AGATTACGATTATCTGAACGAGTGGGGCAGCCGCTTCAAGAAGCTGGCAGACATGTACGGTGGCGGGGAGGACGACT 
AGGCGGCCTGCCTGCAGGGCTGGGGACCAAACGTCAGGCCACAGAGCATCTCCAAGGGGTCTCAGTTCCCCCTTCAG 
CTGAGGACTTCGGAGCTTGTCAGGAAGTGGCCGTAGCAACTTGGCGGAGACAGGCTATGAGTCTGACGTTAGAGTGG 
TTGCTTCCTTAGCCTTTCAGGATGGAGGAATGTGGGCAGTTTGACTTCAGCACTGAAAACCTCTCCACCTGGGCCAG 
GGTTGCCTCAGAGGCCAAGTTTCCAGAAGCCTCTTACCTGCCGTAAAATGCTCAACCCTGTGTCCTGGGCCTGGGCC 
TGCTGTGACTGACCTACAGTGGACTTTCTCTCTGGAATGGAACCTTCTTAGGCCTCCTGGTGCAACTTAATTTTTTT 
TTTTAATGCTATCTTCAAAACGTTAGAGAAAGTTCTTCAAAAGTGCAGCCCAGAGCTGCTGGGCCCACTGGCCGTCC 
TGCATTTCTGGTTTCCAGACCCCAATGCCTCCCATTCGGATGGATCTCTGCGTTTTTATACTGAGTGTGCCTAGGTT 
GCCCCTTATTTTTTATTTTCCCTGTTGCGTTGCTATAGATGAAGGGTGAGGACAATCGTGTATATGTACTAGAACTT 
T T T T AT T AAAG AAA 
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GCGCCTTGCCCGCGGCCCCGCCGGGCCTCAGGCCCGGGGACCCTCAGCGCTACCTTTTGCCTCCGGCTCTGGGACTG 
CTCGTGGCCGCGGGCGTGGCCGGCGCTGCGCTCTTGCTGGTCCACGTGCGCCGCCGTGGCCACTCCCAGGATGCTGG 
GTCTCGCTTGCTGGCTGGGACCCCGGAGCCGTCAGTCCACGCACTCCCGGATGCACTCAACAACCTAAGGACGCAGG 
AGGGTTCCGGGGATGGTCCGAGCTCGTCCGTAGATTGGAATCGCCCTGAAGATGTAGACCCTCAAGGGATTTATGTC 
ATATCTGCTCCTTCCATCTACGCTCGGGAGGTAGCGACGCCCCTTTTCCCCCCGCTACACACTGGGCGCGCTGGGCA 
GAGGCAGCACCTGCTTTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXCGGGCGTGGCCGGCGCTGCGCTCTTGNTGGTCCANGTGCGCNGCCGTGGCCAATCNCAGGATGC 
TGGGTCNCGCTTNCTGGTTNGGACCCCGGNGCCGTNAGTCCACNCACTCCNNGATGCACTCAACAACCTAAGGACGC 
AGGAGGGTTCCCGGGGATGGTCCGAGCTCGTCCGTAGATTGGAATCGCCCTGAAGATGTAGACCCTCACAGGGAGTG 
TATGTAATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTCCGCACGTGGAGTCAGAG 
CGTGGATTTTTGTANNTGCTNGGTGGTGCCCAGTCTCTGCCCCNGAGGCTTTGGANTTCAATCTTGAAGGGGTGTNC 
TGGGGGAACTTTACTGTTGCAAGTTGTAAATAATGGTTNTTTATATCGTNTTTTTTCGTCACCCCATCTCTGATAGA 
AACACCTNTAAAGGNTATTATTGTGAGTCAAAAAAA 
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GCGGCGACTGCGGCGACCGCGGGACGGCGAGAGGCACGCGGCGGGAGGGGACCGGAATCCGCAGCTCCGGCCGCGCC 
ATGGACGGCAACGACAACGTGACCCTGCTCTTCGCCCCTCTGCTGCGGGACAACTACACCCTGGCGCCCAATGCCAG 
CAGCCTGGGCCCCGGCACGGACCTCGCCCTCGCCCCTGCCTCCAGCGCCGGCCCCGGCCCTGGGCTCAGCCTCGGGC 
CGGGTCCGAGCTTCGGCTTCAGCCCCGGCCCCACTCCGACCCCGGAGCCCACGACCAGCGGCCTCGCGGGCGGCGCG 
GCGAGCCACGGCCCTTCCCCGTTCCCTCGGCCCTGGGCGCCCCACGCGCTCCCGTTCTGGGACACGCCGCTGAACCA 
CGGGCTGAACGTGTTCGTGGGCGCCGCCCTGTGCATCACCATGCTGGGCCTGGGCTGCACGGTGGACGTGAACCACT 
TCGGGGCGCACGTCCGTCGGCCCGTGGGCGCGCTGCTGGCAGCGCTCTGCCAGTTCGGCCTCCTGCCGCTGCTGGCC 
TTCCTGCTGGCCCTCGCCTTCAAGCTGGACGAGGTGGCCGCCGTGGCGGTGCTCCTGTGTGGCTGCTGTCCCGGCGG 
CAATCTCTCCAATCTTATGTCCCTGCTGGTTGACGGCGACATGAACCTCAGCATCATCATGACCATCTCCTCCACGC 
TTCTGGCCCTCGTCTTGATGCCCCTGTGCCTGTGGATCTACAGCTGGGCTTGGATCAACACCCCTATCGTGCAGTTA 
CTACCCCTAGGGACCGTGACCCTGACTCTCTGCAGCACTCTCATACCTATCGGGTTGGGCGTCTTCATTCGCTACAA 
ATACAGCCGGGTGGCTGACTACATTGTGAAGGTTTCCCTGTGGTCTCTGCTAGTGACTCTGGTGGTCCTTTTCATAA 
TGACCGGCACTATGTTAGGACCTGAACTGCTGGCAAGTATCCCTGCAGCTGTTTATGTGATAGCAATTTTTATGCCT 
TTGGCAGGCTACGCTTCAGGTTATGGTTTAGCTACTCTCTTCCATCTTCCACCCAACTGCAAGAGGACTGTATGTCT 
GGAAACAGGTAGTCAGAATGTGCAGCTCTGTACAGCCATTCTAAAACTGGCCTTTCCACCGCAATTCATAGGAAGCA 
TGTACATGTTTCCTTTGCTGTATGCACTTTTCCAGTCTGCAGAAGCGGGGATTTTTGTTTTAATCTATAAAATGTAT 
GGAAGTGAAATGTTGCACAAGCGAGATCCTCTAGATGAAGATGAAGATACAGATATTTCTTATAAAAAACTAAAAGA 
AGAGGAAATGGCAGACACTTCCTATGGCACAGTGAAAGCAGAAAATATAATAATGATGGAAACCGCTCAGACTTCTC 
TCTAAATGTGGAGATACACAGGAGCTTCTATCTTGCTGAAATATTGCTTCATATTTATAGCCTGTGGTAGTGCACAT 
GGTTAACATAAAAGATAACACTGGTTCACATCATACATGTAACAATTCTGATCTTTTTAAGGTTCACTGGTGTATTA 
ACCAAACGTTGTCACAAATTACAAATCAATGCTGTAATATAATTTGCACCTGGAATGGCTAACGTGAAGCCTGAATT 
AAATGTGGTTTTTAGTTTTTACCATCACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 

CGGCCAGCACACCCCGGCACCTCCTCTGCGGCAGCTGCGCCTCGCAAGCGCAGTGCCGCAGCGCACGCCGGAGTGGC 
TGTAGCTGCCTCGGCGCGGCTGCCGCCCTGCGCGGGCTGTGGGCTGCGGGCTGCGCCCCCGCTGCTGGCCAGCTCTG 
CACGGCTGCGGGCTCTGCGGCGCCCGGTGCTCTGCAACGCTGCGGCGGGCGGCATGGGATAACGCGGCC ATG GTGCG 
CCGAGATCGCCTCCGCAGGATGAGGGAGTGGTGGGTCCAGGTGGGGCTGCTGGCCGTGCCCCTGCTTGCTGCGTACC 
TGCACATCCCACCCCCTCAGCTCTCCCCTGCCCTTCACTCATGGAAGTCTTCAGGCAAGTTTTTCACTTACAAGGGA 
CTGCGTATCTTCTACCAAGACTCTGTGGGTGTGGTTGGAAGTCCAGAGATAGTTGTGCTTTTACACGGTTTTCCAAC 
ATCCAGCTACGACTGGTACAAGATTTGGAAGGGTCTGACCTTGAGGTTTCATCGGGTGATTGCCCTTGATTTCTTAG 
GCTTTGGCTTCAGTGACAAACCGAGACCACATCACTATTCCATATTTGAGCAGGCCAGCATCGTGGAAGCGCTTTTG 
CGGCATCTGGGGCTCCAGAACCGCAGAATCAACCTTCTTTCTCATGACTATGGAGATATTGTTGCTCAGGAGCTTCT 
CTACAGGTACAAGCAGAATCGATCTGGTCGGCATACCATAAAGAGTCTCTGTCTGTCAAATGGAGGTATCTTTCCTG 
AGACTCACCGTCCACTCCTTCTCCAAAAGCTACTCAAAGATGGAGGTGTGCTGTCACCCATCCTCACACGACTGATG 
AACTTCTTTGTATTCTCTCGAGGTCTCACCCCAGTCTTTGGGCCGTATACTCGGCCCTCTGAGAGTGAGCTGTGGGA 
CATGTGGGCAGGGATCCGCAACAATGACGGGAACTTAGTCATTGACAGTCTCTTACAGTACATCAATCAGAGGAAGA 
AGTTCAGAAGGCGCTGGGTGGGAGCTCTTGCCTCTGTAACTATCCCCATTCATTTTATCTATGGGCCATTGGATCCT 
GTAAATCCCTATCCAGAGTTTTTGGAGCTGTACAGGAAAACGCTGCCGCGGTCCACAGTGTCGATTCTGGATGACCA 
CATTAGCCACTATCCACAGCTAGAGGATCCCATGGGCTTCTTGAATGCATATATGGGCTTCATCAACTCCTTCTGAG 
CTGGAAAGAGTAGCTTCCCTGTATTACCTCCCCTACTCCCTTATGTGTTGTGTATTCCACTTAGGAAGAAATGCCCA 
AAAGAGGTCCTGGCCATCAAACATAATTCTCTCACAAAGTCCACTTTACTCAAATTGGTGAACAGTGTATAGGAAGA 
AGCCAGCAGGAGCTCTGACTAAGGTTGACATAATAGTCCACCTCCCATTACTTTGATATCTGATCAAATGTATAGAC 
TTGGCTTTGTTTTTTGTGCTATTAGGAAATTCTGATGAGCATTACTATTCACTGATGCAGAAAGACGTTCTTTTGCA 
TAAAAGACTTTTTTTTAACACTTTGGACTTCTCTGAAATATTTAGAAGTGCTAATTTCTGGCCCACCCCCAACAGGA 
ATTCTATAGTAAGGAGGAGGAGAAGGGGGGCTCCTTCCCTCTCCTCGAATGACGTTATGGGCACATGCCTTTTAAAA 
GTTCTTTAAGCAACACAGAGCTGAGTCCTCTTTGTCATACCTTTGGATTTAGTGTTTCATCAGCTGTTTTTAGTTAT 
AAACATTTTGTTAAAATAGATATTGGTTTAAATGATACAGTATTTTAGGTATGATTTAAGACTATGATTTACCTATA 
CATTATATATATTTTATAAAGATACTAAACCAGCATACCCTTACTCTGCCAGAGTAGTGAAGCTAATTAAACACGTT 
TGGTTTCTGAATAAATTGAACTAAATCCAAACTATTTCCTAAAATCACAGGACATTAAGGACCAATAGCATCTGTGC 
CAGAGATGTACTGTTATTAGCTGGGAAGACCAATTCTAACAGCAAATAACAGTCTGAGACTCCTCATACCTCAGTGG 
TTAGAAGCATGTCTCTCTTGAGCTACAGTAGAGGGGAAGGGATTGTTGTGTAGTCAAGTCACCATGCTGAATGTACA 
CTGATTCCTTTATGATGACTGCTTAACTCCCCACTGCCTGTCCCAGAGAGGCTTTCCAATGTAGCTCAGTAATTCCT 
GTTACTTTACAGACAGGAAAGTTCCAGAAACTTTAAGAACAAACTCTGAAAGACCTATGAGCAAATGGTGCTGAATA 
CTTTTTTTTTAAAGCCACATTTCATTGTCTTAGTCAAAGCAGGATTATTAAGTGATTATTTAAAATTCGTTTTTTTA 
AATTAGCAACTTCAAGTATAACAACTTTGAAACTGGAATAAGTGTTTATTTTCTATTAATAAAAATGAATTGTGACA 
AAAAAAAAACCG 
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CGGGTGGACAAGTCACCTGTCTAACTGTGGTGAGAGCAACAGGCCTCCAAAGGAAAGAAGTTGTTTCCGAGTTTGTG 
ACTGGCACAGTGACCTCTTTCAGTGGGAGGTTTCTGACTGGCACCACTGTGTGCTTGTTCCTTACGCTCGCGGTGAA 
GTCAAGCCTCGGACTGCAGAGTGTGTGACGGCTCAGCATGGACTGCAGCACCGGATGGTGCGCTGCATTCAGAAGCT 
GAACCGAACTGTGGTTGCAAATGAAATATGCGAACACTTTGCCCTTCAGCCTCCTACAGAACAGGCTTGCCTCATTC 
CTTGTCCCCGGGATTGTGTAGTATCTGAGTTCTTACCATGGTCCAACTGTAGCAAGGGATGTGGGAAGAAATTGCAG 
CATAGAACTCGCGCGGTCATAGCTCCCCCTCTCTTTGGTGGTTTGCAATGTCCAAATCTGACTGAGTCAAGAGCCTG 
TGATGCTCCCATTTCCTGTCCTCTTGGGGAAGAGGAATATACATTTAGCCTTAAGGTTGGACCATGGAGTAAATGCA 
GACTGCCTCATCTTAAAGAAATTAATCCAAGCGGAAGAACTGTTCTGGATTTTAACTCTGATTCAAATGAGCGAGTC 
ACCTTTAAACATCAAAGTTACAAAGCACATCATCATTCGAAGTCTTGGGCAATAGAGATAGGTTATCAAACCCGGCA 
GGTTTCGTGTACAAGAAGTGATGGACAAAATGCTATGTTAAGCCTTTGCCTTCAAGATTCCTTCCCATTGACTGTTC 
AGTCCTGCATCATGCCCAAAGACTGTGAAACCTCCCAGTGGTCCTCCTGGAGCCCCTGCTCCAAGACATGCCGTTCA 
GGGAGTCTCTTGCCAGGATTTAGGAGCAGGAGCCGGAACGTGAAGCACATGGCTATTGGAGGTGGAAAGGAGTGTCC 
TGAACTTCTTGAGAAAGAGGCCTGCATTGTTGAAGGAGAACTTCTGCAGCAATGTCCCAGGTATTCCTGGAGAACTT 
CTGAATGGAAAGAATGCCAAGTCTCTCTCCTCCTCGAGCAGCAGGATCCCCACTGGCATGTGACGGGACCCGTGTGT 
GGCGGTGGGATCCAGACCCGGGAGGTGTACTGTGCCCAGAGCGTACCAGCAGCTGCCGCACTGAGGGCCAAGGAAGT 
CTCTAGACCTGTGGAAAAGGCATTATGTGTGGGACCCGCCCCGTTGCCCTCTCAGCTCTGCAATATCCCTTGCTCTA 
CGGACTGCATAGTATCTTCCTGGTCAGCCTGGGGCCTGTGCATCCATGAAAACTGTCATGAACCTCAGGGGAAAAAA 
GGATTTAGAACGAGGCAGCGCCATGTCCTCATGGAATCTACAGGGCCTGCAGGGCATTGCCCTCATTTGGTGGAGTC 
TGTTCCTTGTGAGGATCCAATGTGCTACCGATGGCTGGCATCAGAAGGGATCTGTTTCCCTGATCATGGAAAATGTG 
GCCTGGGACATCGTATTCTGAAGGCCGTCTGCCAGAATGACCGCGGAGAAGATGTATCAGGGAGTCTTTGCCCAGTT 
CCCCCTCCTCCTGAGAGGAAGTCTTGTGAAATTCCCTGCCGAATGGACTGTGTGCTGAGCGAGTGGACGGAGTGGTC 
ATCCTGTTCCCAGTCCTGTTCAAATAAAAACTCAGATGGGAAACAGACCAGGTCAAGAACTATCCTGGCACTGGCTG 
GGGAAGGTGGAAAGCCATGTCCCCCTAGTCAGGCTCTCCAAGAGCATCGTTTGTGTAATGACCATTCCTGTATGCAG 
CTTCACTGGGAGACATCGCCTTGGGGCCCTTGTTCTGAGGACACATTGGTAACTGCCCTTAATGCAACCATTGGCTG 
GAATGGAGAAGCCACGTGTGGTGTAGGCATTCAGACTCGGAGAGTCTTCTGTGTCAAGAGTCACGTGGGACAAGTAA 
TGACCAAAAGATGTCCAGATTCTACTCGACCTGAAACTGTGCGCCCCTGTTTTCTCCCATGCAAAAAAGACTGTATT 
GTGACTGCTTTCAGTGAGTGGACACCCTGCCCAAGGATGTGCCAAGCAGGAAATGCCACAGTAAAACAGTCTCGATA 
CAGAATCATCATCCAAGAAGCAGCCAATGGAGGCCAGGAATGCCCAGATACCTTATATGAGGAGAGAGAGTGTGAAG 
ATGTTTCCTTGTGTCCTGTATATCGGTGGAAGCCACAGAAATGGAGCCCTTGCATCTTAGTGCCAGAGTCTGTCTGG 
CAGGGAATAACGGGCAGCAGTGAAGCCTGTGGAAAGGGGTTACAAACAAGAGCTGTCTCATGCATCTCTGATGACAA 
CCGGTCAGCAGAAATGATGGAATGCCTCAAGCAGACAAACGGCATGCCTCTCCTTGTGCAAGAATGCACAGTCCCAT 
GTCGAGAAGACTGCACCTTCACTGCTTGGTCCAAGTTTACGCCCTGCTCCACGAACTGTGAAGCCACAAAAAGTAGG 
CGGCGACAGCTCACAGGGAAAAGCAGAAAGAAGGAGAAATGCCAGGATTCTGACCTTTACCCTCTAGTGGAGACAGA 
ACTATGTCCTTGTGATGAATTTATATCCCAACCTTATGGAAACTGGTCAGATTGCATTCTTCCAGAAGGCAGAAGGG 
AGCCTCACCGAGGACTGCGGGTACAAGCAGACAGCAAAGAATGTGGAGAAGGCCTGCGCTTTCGAGCAGTAGCCTGT 
TCTGATAAAAATGGAAGACCTGTTGACCCCTCCTTCTGCAGCAGCTCTGGTTACATTCAAGAAAAATGTGTCATTCC 
CTGCCCATTTGATTGCAAGTTAAGCGATTGGTCTAGTTGGGGGTCTTGCAGTTCATCTTGTGGAATTGGAGTGAGAA 
TTCGATCCAAATGGCTAAAAGAAAAACCTTACAATGGAGGACGACCATGTCCCAAACTGGATCTCAAGAATCAGGCT 
CAGGTACATGAGGCAGTCCCATGTTACAGTGAGTGCAATCAGTATTCCTGGGTTGTAGAACACTGGTCTTCATGCAA 
AATCAACAATGAGCTGAGGTCCCTGCGCTGTGGAGGAGGAACACAATCTAGGAAAATCAGATGTGTGAATACTGCGG 
ATGGTGAAGGTGGAGCAGTGGATAGCAACCTGTGCAACCAGGATGAAATTCCCCCAGAAACCCAGTCCTGTTCTCTT 
ATGTGTCCCAATGAGTGTGTCATGTCTGAGTGGGGACTTTGGAGCAAATGCCCACAGTCATGCGATCCCCACACAAT 
GCAGAGAAGAACTCGCCACCTGCTAAGACCATCACTGAACTCAAGGACTTGTGCTGAAGACTCACAGGTGCAGCCTT 
GCCTCCTGAATGAAAATTGCTTCCAGTTCCAGTACAATCTAACAGAGTGGAGCACATGCCAGCTGAGTGAAAACGCA 
CCCTGTGGTCAAGGCGTCAGGACCCGCCTGCTAAGCTGTGTGTGCAGTGATGGCAAGCCAGTCAGCATGGACCAATG 
TGAGCAGCATAATTTGGAGAAGCCCCAGAGAATGAGCATTCCCTGCTTGGTGGAATGCGTGGTCAACTGTCAGCTCT 
CAGGGTGGACGGCTTGGACAGAGTGTTCACAGACCTGTGGCCATGGAGGTCGAATGAGCCGGACTCGATTTATCATT 
ATGCCAACCCAAGGAGAAGGACGGCCATGCCCCACAGAGCTTACCCAGGAGAAAACCTGCCCAGTGACCCCCTGCTA 
CAGCTGGGTCCTTGGCAACTGGTCTGCATGTAAATTGGAGGGTGGAGACTGTGGGGAAGGAGTTCAGATCCGCAGCC 
TTTCCTGCATGGTCCACAGTGGTTCAATATCTCATGCAGCTGGACGTGTCGAGGATGCACTGTGTGGAGAAATGCCC 
TTTCAGGACAGCATCCTGAAGCAGCTGTGTTCTGTGCCTTGCCCAGGAGACTGCCATTTAACAGAATGGTCAGAGTG 
GAGCACATGTGAATTAACCTGCATTGATGGAAGAAGCTTTGAGACTGTGGGCCGCCAGTCTAGATCAAG 
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GACTTTTATAATTCAGTCTTTTGAGAACCAAGACAGCTGCCCCCAACAGGTTCTAGAAACACGCCCTTGTACAGGAG 
GCAAATGTTATCACTACACATGGAAAGCAAGTCTTTGGAACAATAACGAACGAACTGTATGGTGCCAGCGTTCAGAT 
GGCGTTAATGTCACAGGAGGCTGCTCCCCTCAGGCCCGTCCTGCTGCCATTCGGCAGTGCATTCCAGCCTGCAGAAA 
ACCTTTCTCCTACTGTACACAGGGTGGAGTCTGTGGTTGTGAGAAGGGCTATACAGAGATAATGAGATCAAATGGTT 
TCCTGGATTACTGCATGAAAGTACCAGGCTCAGAGGATAAAAAAGCTGATGTGAAAAACCTTTCTGGGAAAAACAGA 
CCTGTGAATTCAAAAATACATGATATTTTTAAAGGATGGTCTCTTCAACCACTTGATCCAGATGGCCGAGTAAAAAT 
TTGGGTTTATGGCGTTTCAGGTGGCGCTTTTCTCATCATGATTTTCCTAATATTTACTTCCTACCTTGTTTGCAAGA 
AGCCAAAACCACATCAAAGCACACCTCCCCAACAGAAGCCTCTGACCTTAGCCTACGATGGAGACTTAGACATGTAA 
TCTGAAAAAGAAATCCAAATGTAGACATCAACTGCCTTAACCGCTTTCTCTTTTGTAGCTCTCAGACTTCTCAGTTT 
TTTGAGGAATCTCAAGATGTGATATATTGGGCAGAATACAAATATTGCAAAAGTAATATTGCCTCAACTTCATTTGG 
ACATGGAGTCAAGGATTATTAGGTCTGCCATTTTGTTTTCAAGTTGTTTGTGGGTGTGTTTTATTTTTTTGGTTTTC 
CCAAGGGACCTGAAAACCCTTCTCTTCCTGTTTGGAACTGGGAGGAAGAAAACATGATGGAATTCCCACAGACTTGA 
GTAAACTTGATCTTCAGCAGCATAATGACAATCCAGAGGAAAATACAGTCAAGGCATTTACTCTAAATGACGAGTCT 
GACACTGCATTGTTACGCACTATATTAGTGCAAAGTCTTTATTCTTCCCATACTTCAACACTGAGTTTTCTAGAGTT 
TACTTTGGTTTAAAGACTTTCAAATTGGATTGCCTATTTTTCATGAACACAGAGAGAATGGATTACCATTTCAGAAA 
TTCTCTGAGTTTTTAACCTTTAAATATTGTATTTTGTTTTGTAGCCAGGGGATGATGGCGCTTCATGGGTTGCAGCT 
ACTGAAAATAGCAGCGTGTGTGTAATTGCTGGACTAGATGAAAGCTAGGTCATTTCTGAAGGGAATGTGTACTGAAT 
GTTAGAGTGTACAAATGAAATATGTGGTTAAATTGGAGAATGAGGTAGATTATTTGATTACTAAAACTGTATTTTAA 
CAAAAACTTATCCATGTAGATATAGCATTAACCACACACAGTTGTAATTCAGTTTAATGATGACAAACTCTGCTTTT 
GTAATTTCAATTTTCTTATCTGAATATTTATAAATTCTTCTTTCAAATTTAATTATCTGACCTCATTTAATATACAT 
CAAACACCGATCCTGTTTGTACAAAGTCTTGCTTTTATAAGGTTTCAATAATATCTAAAACAACACATTAAAAAGCT 
GAGACCATTTTATGAAGATAATTGTTTGTAATCATAGGTGTTGAAAGTAAAAAGGTGCCATCTTGTGGTATTGACTT 
GTATTTATAACAAATAAACTGCTCAAGAGACTGC 
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CCCACCTCGGCCACACTGCTTCGCTCCCCCCTCCCCACCTTCAGCTGGCACCGCAAACAAGCCTTACCTCGTTGCAT 
CTGCGAGGAGAGGTAGCAACAGCGAGCCTAGCCAGCCAGAGGCGGTGGAGAGGAGGAAGGGGCGGGGTGGGGGGGCA 
GAGAGCGGGCCGAGGCCGCCCTTGGTGGGGGTAGCGGGGGCAGAGCTGCCGAGCAGACCCGGCAGCCGCCCTCCTCC 
GCCCCCACCCTCCAGAAGCACTCTTGCCTGAGAACCGAATTATTCCACTAGTATTATTTCATTTTTTATTACTCCCC 
CTCCTCACTCCCCAGCCGCCCCCACCCCCACGCCGGCCTCGCCTCTGGTTGCATGGCAGCGCTGCCCGGGCGCGGGG 
GCTCAGGGCTGGCCCCCAGGGAAGGGGGAGGAGGAGGAGGATCATGAAGGCCGGAGTCGCGACCGCGCCGGACGGCG 
GGCAGCAGCCAGAGGACGAGCCGGAGCAGCCCCCGCCCCGGAGACACCCGGACGCCGAGCAGCAGTCGCCTCCGCCG 
CCGCAGCAGCAGCTGCGGGCGCGGGCCGACCCGGAGCCGGAGGAGGAGGGCGACCGCGACCCAGAGGAGGAAGAGGA 
GGAGAAGGCGAGGCCACTCGCCCGCCCGGACCTGCCCTTCTCTCGGCTCTTGCCCCTCGACCGAAGGGACCTTTGAT 
GGAACCGAGGGGAGGGCGCCACGGATTTGCCGACTGCAGCAGGGGTGGGCTGGGGGCTGAGATAATGTAACCACTCC 
TTTCTCCTGTTCTCTCCCACACGCCCCTCTCCTCTACCCCTATTCTCTGCTCCACTGCCCTCTCACCCCGGTACACA 
CACCCTTCCTCTAGCCAGGATCTTCATGCTCAGGAAGGAGGCGCCTCTGCAAGGGTTAAACGATCTTTTCTTTTTCT 
CCCATCCTTTTTCCTTCCCAAACCTCTATTTTACCTCCCTTCTCCTAATTGGCTTTCCCCTCTTCGGTGTAACCCTT 
TGGCTGCGGAGGCAAAGCACAAGCCCCTAGCCCAGTTTCACCTGCAACACCCCTCCCCCACCCAACTGCTCTCTTAA 
AAGCAACTCTGGTGCTTCTGGGGGTTAATTGCCCCAGTTTTCTGCCCAGGAGAATTAAAACTTCTCCCAATCTTCTC 
TCCTCCCCTACCTTGACTCCCCCAACCCCTACCACCTGAGAAAAACGATCTTTTCTCTCTCACACATGCAGTCCTCA 
ATTCTTCATTGAGCTAGTTTTCTCTAAGCCCAGCTCAATCCACTCCAGATTTGATTTACAATTGTCCCCACCCTTTT 
ATATAAAAGAAAGATTTCTCACTGCGTAGGAATTTGAGAAGAACCCAATAATCCTTTCCTGGGGAACTTTTAAACAA 
TTCGACATTGATTTAAACAATTCGACAGAGGCTCTAGTGGCCCCTCCACCACGCTTCTTCAATCCCTCTGCTCCTAC 
CAGTGTCCTCAGGTCAAAAGCAGAAAGGAGACACACTGAACAAAGTTGGAGGTTGGGGTGGGTGTGTGAGGGCAAGA 
AAAACTTTTTTGTTATTGGGCTTTCCAGGTGGAGTTCAGAACCAGTGACTCACACTTCTCAGTCCTGGGAGCAATTT 
ATTTGCTACTTGGAGGGGTTGTAAGAAAAGCCAGTGAGAAAGCAGACTCCCCCCACAACACAGATCCACTGTGGACC 
CCCAAAACCTGTCCTGTCCCCCTCTTTTAAGACTCCAGCCACCCCTCTTGGGCTCTCTACTTCCACGGGGCACATGC 
TGATGCCCCTGTGTGGGCTGCTCTGGTGGTGGTGGTGCTGCTGCTCCGGCTGGTACTGCTATGGATTGTGTGCCCCA 
GCCCCCCAGATGTTGCGCCACCAGGGTCTCCTCAAGTGCCGCTGCCGCATGCTCTTCAATGACCTGAAGGTTTTCTT 
ACTGCGGCGCCCTCCTCAAGCGCCCCTGCCCATGCACGGCGACCCCCAGCCCCCCGGTTTGGCGGCCAACAACACCC 
TTCCGGCTCTGGGCGCCGGGGGGTGGGCAGGCTGGAGGGGCCCCCGAGAAGTGGTGGGCAGGGAGCCCCCTCCTGTG 
CCACCTCCACCCCCCTTGCCACCTTCTTCTGTGGAAGATGACTGGGGTGGCCCAGCCACAGAGCCACCTGCCTCGCT 
GCTCAGCAGTGCCTCCTCAGATGACTTCTGTAAGGAGAAGACCGAGGATCGCTACTCACTGGGCAGCAGCTTGGACA 
GTGGTATGAGGACCCCACTCTGCCGCATCTGCTTCCAGGGGCCAGAACAGGGGGAGCTGCTGAGCCCATGCCGCTGT 
GATGGCTCGGTCAAGTGCACACACCAGCCTTGCCTCATCAAGTGGATCAGCGAGCGGGGCTGCTGGAGCTGCGAGCT 
GTGCTACTACAAGTACCACGTCATCGCCATAAGCACAAAAAATCCTCTGCAGTGGCAGGCCATCTCTCTGACGGTCA 
TTGAGAAGGTTCAGGTTGCAGCCGCCATCCTGGGCTCCCTCTTCCTCATCGCCAGTATTTCTTGGCTCATCTGGTCA 
ACTTTCAGCCCCTCGGCAAGATGGCAGCGCCAAGACCTTCTCTTCCAGATCTGCTACGGGATGTATGGCTTCATGGA 
CGTGGTGTGCATAGGTCTCATCATCCATGAAGGACCCTCGGTGTACCGCATCTTTAAACGGTGGCAGGCTGTCAACC 
AGCAGTGGAAAGTGCTGAACTATGACAAGACAAAAGACCTGGAGGATCAAAAGGCAGGAGGCAGGACCAACCCCCGG 
ACCTCCTCATCCACCCAGGCCAATATCCCCTCCTCGGAAGAGGAGACCGCAGGCACCCCTGCCCCTGAGCAGGGCCC 
TGCCCAGGCTGCCGGCCACCCCTCAGGCCCTCTGTCCCATCACCACTGTGCTTATACCATCCTGCACATCCTGAGTC 
ACTTGAGACCTCATGAACAGCGAAGTCCCCCAGGCAGCAGCCGAGAGCTGGTCATGAGAGTCACGACAGTG TGA GAG 
CAGAGGCCCGGAAGGAAGGCCATGACCACCACTGAGGGCCCGGAGCAGGGTGGGGAGGTGCAGTGGCACCCCCGGAG 
CCAACAGAGGGAGCAGGCAGAGGGTGGGGGACCTGGCGGGAGCCCTGGGGTAGTGTCAGAGCGGGAGTGAGGCTGGT 
GCAGGAGCAGTTCTGCTATTTCCAATCAGTCAATGCCACTCTCCACAACAACAATGAAAACCAACACCAACTCAACA 
ACAAAGTGCAATACAGGCTGAACCTGGCCCAACAGAAAAACCCTGCCCCAATGCACCTGCAGGCAAGGTACCCGAAG 
AAGCAGAGGCTGAGGGCAGGCAAAGCCTGTGTGACTGTGGCAGTGCCGGAGGCCAAGGGGGCCAAGAGGAAAAGCAT 
CTGTGGTCTGCCCTGCTCTCACCCTGTTTGGTTTTGTTTCTCCTGGGGCTGTGTTCTGCAGGCAGCCAGAAAAGGAG 
GAGGCACGGGTGAGCTGGCAGGGACACACTGCCTTTGGGGCTCCTGGGCTCATTTGGATGAGCAAGATTCGCTGACA 
AATGGCTGTGGGGATGGTGGGGTGGATGGTCAGGGAGGGATCCTCAGGGAGGGATATGCTGGTGTGAGCAGCCAGAG 
GGAGAGTGTGTCTCCTTCCTGAAGGAACTTCCAAATGGAACTCCCGATTTCAGGTGGGCTAAAAGAGGGCTTAGGTT 
TGGAAAAGGGTGTCCTTCTGTGCCCTTGTTAATTTATTTTATAGTGATTTGGTTCAAAGATGTTTACAGGACACACA 
CACACACACACACACACACACACACACACACACACCCCTAGAGAAAAGTACAGATTTCCAGTGGATATTTCAAGCAC 
AGTTCTGCTGCTGTGGCTTCAGCTTTGGAAGCTGTCAATCCCGGAGCAACTTTCCCAACTACCCAACCCCACCATGG 
CCAGGACATGTGCAATGCCAGCCCCTTCTTGTCTTGGCACATGCACAGACCCAGTCCCCTCACGGTAGG 
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GCACCCCTGACCTACGGGCTTCCAAGAGAGCAGCTGCAGTGGTTGGGAGGAGCTTGACCAGTGTGCCCCAAGGAGTG 
GAGTAGAGCCCAATCTAAGTATTCCTTGCTGCTTGGAACCCTCCCTGTTTGGAACCCTCCCCAAAGAGGCAGTCAGG 
CTGATGCTCAGTGCTTTGTGCTCCCTGCTCCTTCCCGCGTAGCCAGGTGGGCCCAAGGGTGCCTGGCAGGGAGCACT 
ACCCCTGGACCCCTCCTGCTCGCTCTGGGGACCCTGCCAGGGAAGGCCACTGGGTGTTCACCTGCAAAGTTTCTGGT 
TGTCACTGCACAGTGGTCGCGTCATCCATGGGTATTAAAAGGACACTGTCAAGTACTTTTTTAAACTAGTTTTTAGG 
GTTTTTTAAAACTCTCTGTTGTTTGTAATATTCTCTTAAAAGCTTGAAAATAAAACTTCTTTCCCTACC 



80/170 

FIGURE 73 



CAGGAGCAAGGGGACAAGATGATGGAGGAATACAGCCTGGAGAAAAATGAGAGGGCCTGCATTGATTTCGCCATCAG 
CGCCAAGCCGCTGACCCGACACATGCCGCAGAACAAGCAGAGCTTCCAGTACCGCATGTGGCAGTTCGTGGTGTCTC 
CGCCTTTCGAGTACACGATCATGGCCATGATCGCCCTCAACACCATCGTGCTTATGATGAAGTTCTATGGGGCTTCT 
GTTGCTTATGAAAATGCCCTGCGGGTGTTCAACATCGTCTTCACCTCCCTCTTCTCTCTGGAATGTGTGCTGAAAGT 
CATGGCTTTTGGGATTCTGAATTATTTCCGCGATGCCTGGAACATCTTCGACTTTGTGACTGTTCTGGGCAGCATCA 
CCGATATCCTCGTGACTGAGTTTGGGAATAACTTCATCAACCTGAGCTTTCTCCGCCTCTTCCGAGCTGCCCGGCTC 
ATCAAACTTCTCCGTCAGGGTTACACCATCCGCATTCTTCTCTGGACCTTTGTGCAGTCCTTCAAGGCCCTGCCTTA 
TGTCTGTCTGCTGATCGCCATGCTCTTCTTCATCTATGCCATCATTGGGATGCAGGTGTTTGGTAACATTGGCATCG 
ACGTGGAGGACGAGGACAGTGATGAAGATGAGTTCCAAATCACTGAGCACAATAACTTCCGGACCTTCTTCCAGGCC 
CTCATGCTTCTCTTCCGGAGTGCCACCGGGGAAGCTTGGCACAACATCATGCTTTCCTGCCTCAGCGGGAAACCGTG 
TGATAAGAACTCTGGCATCCTGACTCGAGAGTGTGGCAATGAATTTGCTTATTTTTACTTTGTTTCCTTCATCTTCC 
TCTGCTCGTTTCTGATGCTGAATCTCTTTGTCGCCGTCATCATGGACAACTTTGAGTACCTCACCCGAGACTCCTCC 
ATCCTGGGCCCCCACCACCTGGATGAGTACGTGCGTGTCTGGGCCGAGTATGACCCCGCAGCTTGCGGTCGGATTCA 
TTATAAGGATATGTACAGTTTATTACGAGTAATATCTCCCCCTCTCGGCTTAGGCAAGAAATGTCCTCATAGGGTTG 
CTTGCAAGCGGCTTCTGCGGATGGACCTGCCCGTCGCAGATGACAACACCGTCCACTTCAATTCCACCCTCATGGCT 
CTGATCCGCACAGCCCTGGACATCAAGATTGCCAAGGGAGGAGCCGACAAACAGCAGATGGACGCTGAGCTGCGGAA 
GGAGATGATGGCGATTTGGCCCAATCTGTCCCAGAAGACGCTAGACCTGCTGGTCACACCTCACAAGTCCACGGACC 
TCACCGTGGGGAAGATCTACGCAGCCATGATGATCATGGAGTACTACCGGCAGAGCAAGGCCAAGAAGCTGCAGGCC 
ATGCGCGAGGAGCAGGACCGGACACCCCTCATGTTCCAGCGCATGGAGCCCCCGTCCCCAACGCAGGAAGGGGGACC 
TGGCCAGAACGCCCTCCCCTCCACCCAGCTGGACCCAGGAGGAGCCCTGATGGCTCACGAAAGCGGCCTCAAGGAGA 
GCCCGTCCTGGGTGACCCAGCGTGCCCAGGAGATGTTCCAGAAGACGGGCACATGGAGTCCGGAACAAGGCCCCCCT 
ACCGACATGCCCAACAGCCAGCCTAACTCTCAGTCCGTGGAGATGCGAGAGATGGGCAGAGATGGCTACTCCGACAG 
CGAGCACTACCTCCCCATGGAAGGCCAGGGCCGGGCTGCCTCCATGCCCCGCCTCCCTGCAGAGAACCAGACCATCT 
CAGACACCAGCCCCATGAAGCGTTCAGCCTCCGTGCTGGGCCCCAAGGCCCGACGCCTGGACGATTACTCGCTGGAG 
CGGGTCCCGCCCGAGGAGAACCAGCGGCACCACCAGCGGCGCCGCGACCGCAGCCACCGCGCCTCTGAGCGCTCCCT 
GGGCCGCTACACCGATGTGGACACAGGCTTGGGGACAGACCTGAGCATGACCACCCAATCCGGGGACCTGCCGTCGA 
AGGAGCGGGACCAGGAGCGGGGCCGGCCCAAGGATCGGAAGCATCGACAGCACCACCACCACCACCACCACCACCAC 
CACCCCCCGCCCCCCGACAAGGACCGCTATGCCCAGGAACGGCCGGACCACGGCCGGGCACGGGCTCGGGACCAGCG 
CTGGTCCCGCTCGCCCAGCGAGGGCCGAGAGCACATGGCGCACCGGCAGGGCAGTAGTTCCGTAAGTGGAAGCCCAG 
CCCCCTCAACATCTGGTACCAGCACTCCGCGGCGGGGCCGCCGCCAGCTCCCCCAGACCCCCTCCACCCCCCGGCCA 
CACGTGTCCTATTCCCCTGTGATCCGTAAGGCCGGCGGCTCGGGGCCCCCGCAGCAGCAGCAGCAGCAGCAGCAGCA 
GCAGCAGCAGGCGGTGGCCAGGCCGGGCCGGGCGGCCACCAGCGGCCCTCGGAGGTACCCAGGCCCCACGGCCGAGC 
CTCTGGCCGGAGATCGGCCGCCCACGGGGGGCCACAGCAGCGGCCGCTCGCCCAGGATGGAGAGGCGGGTCCCAGGC 
CCGGCCCGGAGCGAGTCCCCCAGGGCCTGTCGACACGGCGGGGCCCGGTGGCCGGCATCTGGCCCGCACGTGTCCGA 
GGGGCCCCCGGGTCCCCGGCACCATGGCTACTACCGGGGCTCCGACTACGACGAGGCCGATGGCCCGGGCAGCGGGG 
GCGGCGAGGAGGCCATGGCCGGGGCCTACGACGCGCCACCCCCCGTACGACACGCGTCCTCGGGCGCCACCGGGCGC 
TCGCCCAGGACTCCCCGGGCCTCGGGCCCGGCCTGCGCCTCGCCTTCTCGGCACGGCCGGCGACTCCCCAACGGCTA 
CTACCCGGCGCACGGACTGGCCAGGCCCCGCGGGCCGGGCTCCAGGAAGGGCCTGCACGAACCCTACAGCGAGAGTG 
ACGATGATTGGTGCTAAGCCCGGGCGAGGTGGCGCCCGCCCGGCCCCCCACGCACCCCACGCACACACCCCACCCGA 
GGAGCCGCGCAGAGGCCGCGGGGGCCCAGCACAGAGGGCCCGGGAGAGGGCCAGCCGGGAGACCCCAGACTCTGGAG 
AGGCCAGGGCTGGGCCACAAGGGTGTCCCGCAGAGACCCTCGGCCAAAAGAGACCCTCCTGGGCAGCCACGGCGCCC 
CCCAACCAGCCCCGATCCCCCCACCCACGACAGGGGCTCTCGGGTGGGAGGCAGGGAGCAGACAAACCACACAGCCA 
AGGGAT T T GAAT T AACTC AGCC AT TTTT GGAGAAC TTT GGGGAACAT G AAAAAAAAAAAAAAAAAAAAAAAAAC AT T 
TTTAAAAGAAAAAACGGGGAGAAAAAAATAGCTTCTATTGATGAGTTTTATCATCTCAATTGAATCTTTCCTTTCCC 
TGATGAAGACAGCTGGTGGCCGAGTGCGGCAAAGAAGCCAGAAGGAACCAGAATCCCAGTGCCCTACACCCACCACC 
AGACACACTCACACCCACACACGTTCTCAGACACACACAAGAGTGCTTGCCGGTTATACCAAACCCTACTATTACTG 
CCTGCAGAAATCAATTTAAAAAAATAATAATAACAATAAACAATTTTAAAAAGGACAAAAAAATTAATGATTGAGAA 
AAGAGGCATTTTTTTCTGACATTTGGTCCTGCTTGAAACAACAAAAGAAGAAGAAAAACCCACCATCACCACCGATT 
CCTTTGCTTCTTTTTTCCTTTTTTCCTACCTTGTTTGAAAACCGTGGGCTTGGGACTGTGAA 
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CTGCCCCTGGGTCTCTGCGCCTTTGCATGAGACTTTACGGTAAGCCGCTCCTCCCGCGCCCCCGCCCCCAGCCCCGC 
TCGGCGATCCCCGGCGCCGTCGCCAGGCGCTGGCCGTGGTGCTGATTCTGTCAGGCGCTGGCGGCGGCAGCGGCGGT 
GACGGCTGCGGCCCCGCTCCCTCTACCCGGCCGGACCCGGCTCTGCCCCCGCGCCCAAGCCCCACCAAGCCCCCCGC 
CCTCCCGCCGCGGTCCCAGCCCAGGGCGCGGCCGCAACCAGCACC ATG CGCCCGGTAGCCCTGCTGCTCCTGCCCTC 
GCTGCTGGCGCTCCTGGCTCACGGACTCTCTTTAGAGGCCCCAACCGTGGGGAAAGGACAAGCCCCAGGCATCGAGG 
AGACAGATGGCGAGCTGACAGCAGCCCCCACACCTGAGCAGCCAGAACGAGGCGTCCACTTTGTCACAACAGCCCCC 
ACCTTGAAGCTGCTCAACCACCACCCGCTGCTTGAGGAATTCCTACACGAGGGGCTGGAAAAGGGAGATGAGGAGCT 
GAGGCCAGCACTGTCCTTTCAGCCTGACCCACCTGCACCCTTCACCCCAAGTCCCCTTCCCCGCCTGGCCAACCAGG 
ACAGCCGCCCTGTCTTTACCAGCCCCACTCCAGCCATGGCTGCGGTACCCACTCAGCCCCAGTCCAAGGAGGGACCC 
TGGAGTCCGGATCCGGAGTCAGAGTCCCCTATGCTTCGAATCACAGCTCCCCTACCTCCAGGGCCCAGCATGGCAGT 
GCCCACCCTAGGCCCAGGGGAGATAGCCAGCACTACACCCCCCAGCAGAGCCTGGACACCAACCCAAGAGGGTCCTG 
GAGACATGGGAAGGCCGTGGGTTGCAGAGGTTGTGTCCCAGGGCGCAGGGATCGGGATCCAGGGGACCATCACCTCC 
TCCACAGCTTCAGGAGATGATGAGGAGACCACCACTACCACCACCATCATCACCACCACCATCACCACAGTCCAGAC 
ACCAGGCCCTTGTAGCTGGAATTTCTCAGGCCCAGAGGGCTCTCTGGACTCCCCTACAGACCTCAGCTCCCCCACTG 
ATGTTGGCCTGGACTGCTTCTTCTACATCTCTGTCTACCCTGGCTATGGCGTGGAAATCAAGGTCAAGAATATCAGC 
CTCCGGGAAGGGGAGACAGTGACTGTGGAAGGCCTGGGGGGGCCTGACCCACTGCCCCTGGCCAACCAGTCTTTCCT 
GCTGCGGGGCCAAGTCATCCGCAGCCCCACCCACCAAGCGGCCCTGAGGTTCCAGAGCCTCCCGCCACCGGCTGGCC 
CTGGCACCTTCCATTTCCATTACCAAGCCTATCTCCTGAGCTGCCACTTTCCCCGTCGTCCAGCTTATGGAGATGTG 
ACTGTCACCAGCCTCCACCCAGGGGGTAGTGCCCGCTTCCATTGTGCCACTGGCTACCAGCTGAAGGGCGCCAGGCA 
TCTCACCTGTCTCAATGCCACCCAGCCCTTCTGGGATTCAAAGGAGCCCGTCTGCATCGGTGAGTGCCCAGGGGTGA 
TCCGCAATGCCACCACCGGCCGCATCGTCTCTCCAGGCTTCCCGGGCAACTACAGCAACAACCTCACCTGTCACTGG 
CTGCTTGAGGCTCCTGAGGGCCAGCGGCTACACCTGCACTTTGAGAAGGTTTCCCTGGCAGAGGATGATGACAGGCT 
CATCATTCGCAATGGGGACAACGTGGAGGCCCCACCAGTGTATGATTCCTATGAGGTGGAATACCTGCCCATTGAGG 
GCCTGCTCAGCTCTGGCAAACACTTCTTTGTTGAGCTCAGTACTGACAGCAGCGGGGCAGCTGCAGGCATGGCCCTG 
CGCTATGAGGCCTTCCAGCAGGGCCATTGCTATGAGCCCTTTGTCAAATACGGTAACTTCAGCAGCAGCACACCCAC 
CTACCCTGTGGGTACCACTGTGGAGTTCAGCTGCGACCCTGGCTACACCCTGGAGCAGGGCTCCATCATCATCGAGT 
GTGTTGACCCCCACGACCCCCAGTGGAATGAGACAGAGCCAGCCTGCCGAGCCGTGTGCAGCGGGGAGATCACAGAC 
TCGGCTGGCGTGGTACTCTCTCCCAACTGGCCAGAGCCCTACGGTCGTGGGCAGGATTGTATCTGGGGTGTGCATGT 
GGAAGAGGACAAGCGCATCATGCTGGACATCCGAGTGCTGCGCATAGGCCCTGGTGATGTGCTTACCTTCTATGATG 
GGGATGACCTGACGGCCCGGGTTCTGGGCCAGTACTCAGGGCCCCGTAGCCACTTCAAGCTCTTTACCTCCATGGCT 
GATGTCACCATTCAGTTCCAGTCGGACCCCGGGACCTCAGTGCTGGGCTACCAGCAGGGCTTCGTCATCCACTTCTT 
TGAGGTGCCCCGCAATGACACATGTCCGGAGCTGCCTGAGATCCCCAATGGCTGGAAGAGCCCATCGCAGCCTGAGC 
TAGTGCACGGCACCGTGGTCACTTACCAGTGCTACCCTGGCTACCAGGTAGTGGGATCCAGTGTCCTCATGTGCCAG 
TGGGACCTAACTTGGAGTGAGGACCTGCCCTCATGCCAGAGGGTGACTTCCTGCCACGATCCTGGAGATGTGGAGCA 
CAGCCGACGCCTCATATCCAGCCCCAAGTTTCCCGTGGGGGCCACCGTGCAATATATCTGTGACCAGGGTTTTGTGC 
TGATGGGCAGCTCCATCCTCACCTGCCATGATCGCCAGGCTGGCAGCCCCAAGTGGAGTGACCGGGCCCCTAAATGT 
CTCCTGGAACAGCTCAAGCCATGCCATGGTCTCAGTGCCCCTGAGAATGGTGCCCGAAGTCCTGAGAAGCAGCTACA 
CCCAGCAGGGGCCACCATCCACTTCTCGTGTGCCCCTGGCTATGTGCTGAAGGGCCAGGCCAGCATCAAGTGTGTGC 
CTGGGCACCCCTCGCATTGGAGTGACCCCCCACCCATCTGTAGGGCTGCCTCTCTGGATGGTTCTACAACAGTCGCA 
GCCTGGATGGTTGCCAAGGCACCTGCTGCCTCCAGCACCCTGGATGCTGCCCACATTGCAGCTGCCATCTTCTTGCC 
ACTGGTGGCGATGGTGTTGTTGGTAGGAGGTGTATACTTCTACTTCTCCAGGCTCCAGGGAAAAAGCTCCCTGCAGC 
TGCCCCGCCCCCGCCCCCGCCCCTACAACCGCATTACCATAGAGTCAGCGTTTGACAATCCAACTTACGAGACTGGA 
TCTCTTTCCTTTGCAGGAGACGAGAGAATATGAAGTCTCCATCTAGGTGGGGGCAGTCTAGGGAAGTCAACTCAGAC 
TTGCACCACAGTCCAGCAGCAAGGCTCCTTGCTTCCTGCTGTCCCTCCACCTCCTGTATATACCACCTAGGAGGAGA 
TGCCACCAAGCCCTCAAGAAGTTGTGCCCTTCCCCGCCTGCGATGCCCACCATGGCCTATTTTCTTGGTGTCATTGC 
CCACTTGGGGCCCTTCATTGGGCCCATGTCAGGGGGCATCTACCTGTGGGAAGAACATAGCTGGAGCACAAGCATCA 
ACAGCCGGCATCCTGAGCCTCCTCATGCCCTGGACCAGCCTGGAACACACTAGCAGAGCAGGAGTACCTTTCTCCAC 
ATGACCACCATCCCGCCCTGGCATGGCAACCTGCAGCAGGATTAACTTGACCATGGTGGGAACTGCACCAGGGTACT 
CCTCACAGCGCCATCACCAATGGCCAAAACTCCTCTCAACGGTGACCTCTGGGTAGTCCTGGCATGCCAACATCAGC 
CTCTTGGGAGGTCTCTAGTTCTCTAAAGTTCTGGACAGTTCTGCCTCCTGCCCTGTCCCAGTGGAGGCAGTAATTCT 
AGGAGATCCTAAGGGGTTCAGGGGGACCCTACCCCCACCTCAGGTTGGGCTTCCCTGGGCACTCATGCTCCACACCA 
AAGCAGGACACGCCATTTTCCACTGACCACCCTATACCCTGAGGAAAGGGAGACTTTCCTCCGATGTTT 



82/170 



FIGURE 74B 

ATTTAGCTGTTGCAAACATCTTCACCCTAATAGTCCCTCCTCCAATTCCAGCCACTTGTCAGGCTCTCCTCTTGACC 
ACTGTGTTATGGGATAAGGGGAGGGGGTGGGCATATTCTGGAGAGGAGCAGAGGTCCAAGGACCCAGGAATTTGGCA 
TGGAACAGGTGGTAGGAGAGCCCCAGGGAGACGCCCAGGAGCTGGCTGAAAGCCACTTTGTACATGTAATGTATTAT 
ATGGGGTCTGGGCTCCAGCCAGAGAACAATCTTTTATTTCTGTTGTTTCCTTATTAAAATGGTGTTTTTGGAAAAAAAA 
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GCTGAGCCTGAGCCCGACCCGGGGCGCCTCCCGCCAGGCACCATGGTGCAGAAGTCGCGCAACGGCGGCGTATACCC 
CGGCCCGAGCGGGGAGAAGAAGCTGAAGGTGGGCTTCGTGGGGCTGGACCCCGGCGCGCCCGACTCCACCCGGGACG 
GGGCGCTGCTGATCGCCGGCTCCGAGGCCCCCAAGCGCGGCAGCATCCTCAGCAAACCTCGCGCGGGCGGCGCGGGC 
GCCGGGAAGCCCCCCAAGCGCAACGCCTTCTACCGCAAGCTGCAGAATTTCCTCTACAACGTGCTGGAGCGGCCGCG 
CGGCTGGGCGTTCATCTACCACGCCTACGTGTTCCTCCTGGTTTTCTCCTGCCTCGTGCTGTCTGTGTTTTCCACCA 
TCAAGGAGTATGAGAAGAGCTCGGAGGGGGCCCTCTACATCCTGGAAATCGTGACTATCGTGGTGTTTGGCGTGGAG 
TACTTCGTGCGGATCTGGGCCGCAGGCTGCTGCTGCCGGTACCGTGGCTGGAGGGGGCGGCTCAAGTTTGCCCGGAA 
ACCGTTCTGTGTGATTGACATCATGGTGCTCATCGCCTCCATTGCGGTGCTGGCCGCCGGCTCCCAGGGCAACGTCT 
TTGCCACATCTGCGCTCCGGAGCCTGCGCTTCCTGCAGATTCTGCGGATGATCCGCATGGACCGGCGGGGAGGCACC 
TGGAAGCTGCTGGGCTCTGTGGTCTATGCCCACAGCAAGGAGCTGGTCACTGCCTGGTACATCGGCTTCCTTTGTCT 
CATCCTGGCCTCGTTCCTGGTGTACTTGGCAGAGAAGGGGGAGAACGACCACTTTGACACCTACGCGGATGCACTCT 
GGTGGGGCCTGATCACGCTGACCACCATTGGCTACGGGGACAAGTACCCCCAGACCTGGAACGGCAGGCTCCTTGCG 
GCAACCTTCACCCTCATCGGTGTCTCCTTCTTCGCGCTGCCTGCAGGCATCTTGGGGTCTGGGTTTGCCCTGAAGGT 
TCAGGAGCAGCACAGGCAGAAGCACTTTGAGAAGAGGCGGAACCCGGCAGCAGGCCTGATCCAGTCGGCCTGGAGAT 
TCTACGCCACCAACCTCTCGCGCACAGACCTGCACTCCACGTGGCAGTACTACGAGCGAACGGTCACCGTGCCCATG 
TACAGACTTATCCCCCCGCTGAACCAGCTGGAGCTGCTGAGGAACCTCAAGAGTAAATCTGGACTCGCTTTCAGGAA 
GGACCCCCCGCCGGAGCCGTCTCCAAGCCAGAAGGTCAGTTTGAAAGATCGTGTCTTCTCCAGCCCCCGAGGCGTGG 
CTGCCAAGGGGAAGGGGTCCCCGCAGGCCCAGACTGTGAGGCGGTCACCCAGCGCCGACCAGAGCCTCGAGGACAGC 
CCCAGCAAGGTGCCCAAGAGCTGGAGCTTCGGGGACCGCAGCCGGGCACGCCAGGCTTTCCGCATCAAGGGTGCCGC 
GTCACGGCAGAACTCAGAAGAAGCAAGCCTCCCCGGAGAGGACATTGTGGATGACAAGAGCTGCCCCTGCGAGTTTG 
TGACCGAGGACCTGACCCCGGGCCTCAAAGTCAGCATCAGAGCCGTGTGTGTCATGCGGTTCCTGGTGTCCAAGCGG 
AAGTTCAAGGAGAGCCTGCGGCCCTACGACGTGATGGACGTCATCGAGCAGTACTCAGCCGGCCACCTGGACATGCT 
GTCCCGAATTAAGAGCCTGCAGTCCAGAGTGGACCAGATCGTGGGGCGGGGCCCAGCGATCACGGACAAGGACCGCA 
CCAAGGGCCCGGCCGAGGCGGAGCTGCCCGAGGACCCCAGCATGATGGGACGGCTCGGGAAGGTGGAGAAGCAGGTC 
TTGTCCATGGAGAAGAAGCTGGACTTCCTGGTGAATATCTACATGCAGCGGATGGGCATCCCCCCGACAGAGACCGA 
GGCCTACTTTGGGGCCAAAGAGCCGGAGCCGGCGCCGCCGTACCACAGCCCGGAAGACAGCCGGGAGCATGTCGACA 
GGCACGGCTGCATTGTCAAGATCGTGCGCTCCAGCAGCTCCACGGGCCAGAAGAACTTCTCGGCGCCCCCGGCCGCG 
CCCCCTGTCCAGTGTCCGCCCTCCACCTCCTGGCAGCCACAGAGCCACCCGCGCCAGGGCCACGGCACCTCCCCCGT 
GGGGGACCACGGCTCCCTGGTGCGCATCCCGCCGCCGCCTGCCCACGAGCGGTCGCTGTCCGCCTACGGCGGGGGCA 
ACCGCGCCAGCATGGAGTTCCTGCGGCAGGAGGACACCCCGGGCTGCAGGCCCCCCGAGGGGACCCTGCGGGACAGC 
GACACGTCCATCTCCATCCCGTCCGTGGACCACGAGGAGCTGGAGCGTTCCTTCAGCGGCTTCAGCATCTCCCAGTC 
CAAGGAGAACCTGGATGCTCTCAACAGCTGCTACGCGGCCGTGGCGCCTTGTGCCAAAGTCAGGCCCTACATTGCGG 
AGGGAGAGTCAGACACCGACTCCGACCTCTGTACCCCGTGCGGGCCCCCGCCACGCTCGGCCACCGGCGAGGGTCCC 
TTTGGTGACGTGGGCTGGGCCGGGCCCAGGAAGTGAGGCGGCGCTGGGCCAGTGGACCCGCCCGCGGCCCTCCTCAG 
CACGGTGCCTCCGAGGTTTTGAGGCGGGAACCCTCTGGGGCCCTTTTCTTACAGTAACTGAGTGTGGCGGGAAGGGT 
GGGCCCTGGAGGGGCCCATGTGGGCTGAAGGATGGGGGCTCCTGGCAGTGACCTTTTACAAAAGTTATTTTCCAACA 
GGGCACTCCCAGGCCCTGTCGCCATTGAGGTGCCTCCGCTGGGCTGTCTCCTCACCCCTCCCTGTGCTGGAGCCTGT 
CCCAAAAAGGTGCCAACTGGGAGGCCTCGGAAGCCACTGTCCAGGCTCCCACTGCCTGTCTGCTCTGTTCCCAAAGG 
CAGCGTGTGTGGCCTCGGGCCCTGCGGTGGCATGAAGCATCCCTTCTGGTGTGGGCATCGCTACGTGTTTTGGGGGC 
AGCGTTTCACGGCGGTGCCCTTGCTGTCTCCCTTGGGCTGGCTCGAGCCTGGGGTCCATGTCCCTTTGCCGTCCCGT 
CATGGGGCAGGGAATCCATAGCGGGGCCCACAGGCAGGGGTATGAGTGCGTCCCACCCAACGCAGCACCAGCCCCGG 
CCACCGCTCCCCGTGTCCCCAGTTCCGTCTCAGCTACCTGGACTCCAGGACCCTGGAGAAGGGAGACCTGGCAGTGG 
AGGGAGGCTGTGCTGTGTGTCCCCCTGCAGGTGTGACCCCGCCTGCTCTTTCCTCCCCCGCCAGGTGTGGCCCCGCC 
TGCTCTTTCCTCCCCCACCAGTATGGCCCCACCTGCTCTTTCCTXCCCCCCCAAGGTGTGGCCCCACCTGTTCTTTC 
CTCCCCTGCCGAGGTGTGACCCCACCTGCTCTTTCCTCCCTCCCAGTATGGCCCCACCTGCTCTTTCCTCCCCCGAG 
GTGAGGCCCCGCCTGCTCTTTCCTCCCATGGGAGCCGCTGAGGCGTGCGCACCTGGGCACAGGTTGGGGCTCTGCAG 
GATGAGGAAGACAGGCCAATCCCTTCCCTCCCAGAAGCTGGCCGCCCAGCAGGAGGGACTGAGGCCAGACTCATGTC 
CAGCAAGGAACGTGTGGTGTGTCCCCTGGGAAGTCTCTGGGCCCTGGGAAGAGGGAAGGTGCACGTCCTGGGATGGT 
TGCGGGGCCCTGTTTTGGGAGACAAAGGGGTAGAGGGTCTGTCTTGXGCCCCCCCAGACTCTAGCCCGAGCAGTGCA 
GCCACCTACTGCCCCACCTCAGAGAAGTGCAGCGGGAAGGAGGCTGGAGGTGGTGCGGCGCTGCCTCGGGTGTCTGC 
GTGAATGAGCGTGGCCAAGGACCAGTGCCACCTCATGGCAAAGAGCTCCCGCAGTGTTTGTTAGAGTGCACATCCTA 
CGTGCCCACTGGCACACACACGTGCTCACATACAXGTCXGXGTACAGGXGTACACATGCAXGCTTGCAC 



84/170 

FIGURE 75B 

ACATGCACACAGACCACATAGCACACATGTGCACTGACCACACCTGTATAGACCATGCACAGTACACATACGTGCAT 
ACACATGCCTGCATACAGGCATACACATGCACGCTTACATGTACACGTGCACAGATCACACACATGCACACACGTGT 
AGCTCACACACAGTATACACATACACAAGTGCACAGACCACACACAGCACTAACACATGCACACACAAAGTGCATAG 
GCCACACAGCACATGCACACAGGTGCACAGACCACACAGCACACACAAGTGCACAGAGCACACTGCACACATGCACA 
CACACACGCGTGCATGCACACTCCTCGCACTTCCAGCCTTGGAGCCCTTCTGTCTCTGGTCTTTCTCTTTGACCCTG 
CTGAGTGTAAGCTGCCTGGGGAGGGGCTACAAGGAGTAATTGTGGCTTTAGGGGTCGTGGTGATGCTGGAATGTCAA 
GCGCCGTCGTGGGGTATCCGACTGTCCGGGCTCCTGGTCCGCAGTGGCAGAGCGCCAGGCAGAGCCAATCAGGGTCT 
CGTGCTGCCCTTCCCCCCCACAGCCTGGCAGCCATCCAGAGGAGGGGCTCTACCAGATGCCAAGGTGCCCCGGTGTC 
TGTATGGGTGTCCGGTTGGGTCCTGTGTTTGGTCTGCCCTGGAGGTGGCTGGGCCCTCCTGGGATGGGTGGCTCAGC 
CTCGAATCCCAGGCCCCAGCCCAGGCAGGTGCTGCTGCCTGTTGTGGTTTCCTGGCCCAGCTTCTCCTTCTCCCTCT 
GCATAAAATCACAGTCCGTGAGTCTTCCAGCTGCCACCACGGCTGGGACACGCTGGGGGAGGGCTCCTCCCATGCCT 
CCTGCACACAGCCGTCTGAGCAGGGCAGGTGCCAACACCCCCCACCGGAGACACGCTGCCCCTCAGCGATGCCCCTA 
CCTTTTGGGGGGCCTCGTCTCAAGCCCCCCCTTGGAGGCTGAAATCACCCCAGGCACTGTGAGGGCTTCTCCAGGGG 
GACACCCTTTGAGCTGTGGGTCTGATCACCCCAAGTCCCGCACACGGAGGAGAGGCACAGCCAGGGCGTGTGGTTTA 
ATGTTTGCCCCTTCGGGGCTGGAGGTCTCAGTGTTTCTAGATTCCAGACCCTGCTGCCAGAGAGACCTGCTGCCGGA 
GAGAAGGGGAGGAGGACTCCAGCTGGGCTCGGTCCCCCACAGTCAGGGACCCCCATAAAGGACACCCCCTTCTCTCT 
AGAAAGAGCTGGGCTCTCAGCTATTTCTAGTTGCTTCCCAGAAGCCGAGGAGCAGAAGGAGCTGTGAGAGCTTTGCA 
GAAACGCCCTTGTCCCCGCCCTCCTGAGCTATGAATGCCGTACAGAGCAGAGGCTGGGGCATTGGCAAGATCACAGG 
TTGATGCTGCACAGCCCCATTGACACAAACCCTCAAAGCAGACGTGAGAGGGACGGTTCACAAAGCTTGGACCTGCC 
GTGGAGGGTGCCCGGCAGACGTGGCGTGAGAGGGACGGCTCACGAGGCTTGGACCTGCTGTGGAGGGTGCCCAGCAG 
ACGTGGTGTGAGAGGAACGGCTCACGAGACTTGGACCTGGTGGAGGGTGCCCAGCAGACGTGGTGTGAGAGGGACGG 
CTCACAGGGCTTGGACCGGAGAGAGATGGCTCATGAGACTTGGACCTGCCGTGGAGGGTGCCCAGCAGACGTGGTAT 
GAGAGGGATGGCTCACGAGGCTTGGACCTGGTGGAGGGTGCCCGGCAGACGTGTGAGAGGGACGGTTCACAAGGCTT 
GGACCTGCCATGGAGGGTGCCCAGCAGACGTGGTGTGAGAGGGACAGCTCACGAGGCTTGGACCTGCCGTGGAGGGT 
GCCCAGCAGGGGGCTGAGCTCTGAGGGGTGGGTGCTCAGTGCACGGGTGCCCCCAGTGTCCTCTGATCCTGTCCGGT 
GCCTCCCCCAACCCCCACACCCATGCAGAACTCCCAGGTCACATGCACGTATGTCCAGGGCATGGGGGTGGCGTGAA 
GAGGCCTGGTCAGGGCCTTTAGGGGCTGCAGGACGGAATGGCCACCTGGGGAGCCTGTGTGGCTGTGCCGGGCAGCC 
ATCCTGCATTCCCACCCAGCGCGCAGTCTCCACCTCGGCCCCAGCAAAGCGCTAAGCAGCCGGAGAGACAGCCAGGG 
CGGCTTCCTGAAGGATGTGGGATGGTGGACTCCGGGGTCGAGGGAATACGCAGGTTCCTGTCCTCCGGGAGACCTAG 
AGAAGCTGCACACCCAGGAGCTTTCCATGACCCGGGAGCATGAGTGAATGGGGGGTTCCAGTTTGCTGAACTTTGCT 
GTCTTGTAAGGGTGGGGGCTGACGGCCGACCCTGGGAGGAGGTGACACCGCAGGGGGAGGTTGTGGGCAACGGTGGA 
GGAGGAGAGACGGGAGGGGACCATTTGGGATGGAGGGGCCTCTTCAGAGTTTTAAAAGGCGTTTGTGGGGTGGAGTT 
GAGTGTGCTCTGGGCTTGGACACTTGCCGTGGTGCCCCTGGCTGGCCGAGGAGACTGGCTCTGGCCAGGGCCCCGTC 
CTGAGAGGTCCTCAGCGTCTGACTCTCGGCCAGGCGCCAGCAAGGAGGGGCCGGTCCCCGGGGCTACCAGGCAGGCA 
CGTGCACATCGCCATCGCCACACGCCAACTCCGCCTGGGTTTTACAAAGTCGTTGCCTTAATGCATGTGGACAGGAA 
CTCCCTGAGGTCGCCCCATGCCCCCTGGCTGTGCCAGGTACGGACGCCCTGGACCCTGCGAACAGGTGGGGCGGGCG 
AGGGGCCCAAGGGACGGGCTCCAGAGACACGCGCAGGGCAGGAGGGGTCTCACGGAGGGGTCTCGCACTGAGGCGCC 
CAGAGCTGGTGGTCCCGCTGGACGCCATCCCTCTGCCCGGGATCCACACGGCCCACGTGTGCCCGCCATGCCCGCGC 
CCCACGCCATTGCAGTCTTCCATCCTCTGGCCGTGACGGTGGCTGCAGCTTCCCCATTTGCGCCGTTGCCTCTGGCT 
GTCTGCACTTTTGTTCATGCTCCAAAGAACATTTCATAATGCCTTCAGTACCGACGTACACTTCTGACCATTTTGTA 
TGTGTCCTTGTGCCGTAGTGACCAGGCCTTTTTTTGGTGGATGTGTTACCCCGCACACTTCAATCTCAACTTTGTGC 
ACCGTCCATTTTCTAGGGATAGACGCCCAGGGAATGAACTCTAGTTTTCTAACAGATTAGCTGAGATATTAACTTAC 
TCACACGGACAGGTTGATGCCAGAGCCGTAAGAATGCGCCAGTGCGGGTTTGCGGGGGACTTCGGGTGTGGGGTCCT 
GCGGCCGCGATGGCCGTGGAAGGTTCTGGGGATCCCTGCTGCCACGGGGACGAGTTCGGACGCCAGGTGGACCTGTG 
CACTCAGTAAAACGCAGTGATTCAAAAAAAAAAAAA 



85/170 



FIGURE 76 

GGCACGAGGCAAGTCCGGATGAAGAATTAAGAGAAAAAAAGTGAATATGGTTTTTGCTCACAGAATGGATAACAGCA 
AGCCACATTTGATTATTCCTACACTTCTGGTGCCCCTCCAAAACCGCAGCTGCACTGAAACAGCCACACCTCTGCCA 
AGCCAATACCTGATGGAATTAAGTGAGGAGCACAGTTGGATGAGCAACCAAACAGACCTTCACTATGTGCTGAAACC 
CGGGGAAGTGGCCACAGCCAGCATCTTCTTTGGGATTCTGTGGTTGTTTTCTATCTTCGGCAATTCCCTGGTTTGTT 
TGGTCATCCATAGGAGTAGGAGGACTCAGTCTACCACCAACTACTTTGTGGTCTCCATGGCATGTGCTGACCTTCTC 
ATCAGCGTTGCCAGCACGCCTTTCGTCCTGCTCCAGTTCACCACTGGAAGGTGGACGCTGGGTAGTGCAACGTGCAA 
GGTTGTGCGATATTTTCAATATCTCACTCCAGGTGTCCAGATCTACGTTCTCCTCTCCATCTGCATAGACCGGTTCT 
ACACCATCGTCTATCCTCTGAGCTTCAAGGTGTCCAGAGAAAAAGCCAAGAAAATGATTGCGGCATCGTGGATCTTT 
GATGCAGGCTTTGTGACCCCTGTGCTCTTTTTCTATGGCTCCAACTGGGACAGTCATTGTAACTATTTCCTCCCCTC 
CTCTTGGGAAGGCACTGCCTACACTGTCATCCACTTCTTGGTGGGCTTTGTGATTCCATCTGTCCTCATAATTTTAT 
TTTACCAAAAGGTCATAAAATATATTTGGAGAATAGGCACAGATGGCCGAACGGTGAGGAGGACAATGAACATTGTC 
CCTCGGACAAAAGTGAAAACTATCAAGATGTTCCTCATTTTAAATCTGTTGTTTTTGCTCTCCTGGCTGCCTTTTCA 
TGTAGCTCAGCTATGGCACCCCCATGAACAAGACTATAAGAAAAGTTCCCTTGTTTTCACAGCTATCACATGGATAT 
CCTTTAGTTCTTCAGCCTCTAAACCTACTCTGTATTCAATTTATAATGCCAATTTTCGGAGAGGGATGAAAGAGACT 
TTTTGCATGTCCTCTATGAAATGTTACCGAAGCAATGCCTATACTATTACAACAAGTTCAAGGATGGCCAAAAAAAA 
CTACGTTGGCATTTCAGAAATCCCTTCCATGGCCAAAACTATTACCAAAGACTCGATCTATGACTCATTTGACAGAG 
AAGCCAAGGAAAAAAAGCTTGCTTGGCCCATTAACTCAAATCCACCAAATACTTTTGTC TAA GTTCTCATTCTTTCA 
ATTGTTATGCACCAGAGATTAAAAAGCTTTAACTATAAAAACAGAAGCTATTTACATATTTGTTTTCACTCAACTTT 
CCAAGGGAAATGTTTTATTTTGTAAAATGCATTCATTTGTTTACTGT 
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GCCATGGCCGTCCGGCCCGGCCTGTGGCCAGCGCTCCTGGGCATAGTCCTCGCCGCTTGGCTCCGCGGCTCGGGTGC 
CCAGCAGAGTGCCACCGTGGCCAACCCAGTGCCTGGTGCCAACCCGGACCTGCTTCCCCACTTCCTGGTGGAGCCCG 
AGGATGTGTACATCGTCAAGAACAAGCCAGTGCTGCTTGTGTGCAAGGCCGTGCCCGCCACGCAGATCTTCTTCAAG 
TGCAACGGGGAGTGGGTGCGCCAGGTGGACCACGTGATCGAGCGCAGCACAGACGGGAGCAGTGGGCTGCCCACCAT 
GGAGGTCCGCATTAATGTCTCAAGGCAGCAGGTCGAGAAGGTGTTCGGGCTGGAGGAATACTGGTGCCAGTGCGTGG 
CATGGAGCTCCTCGGGCACCACCAAGAGTCAGAAGGCCTACATCCGCATAGCCTATTTGCGCAAGAACTTCGAGCAG 
GAGCCGCTGGCCAAGGAGGTGTCCCTGGAGCAGGGCATCGTGCTGCCCTGCCGTCCACCGGAGGGCATCCCTCCAGC 
CGAGGTGGAGTGGCTCCGGAACGAGGACCTGGTGGACCCGTCCCTGGACCCCAATGTATACATCACGCGGGAGCACA 
GCCTGGTGGTGCGACAGGCCCGCCTTGCTGACACGGCCAACTACACCTGCGTGGCCAAGAACATCGTGGCACGTCGC 
CGCAGCGCCTCCGCTGCTGTCATCGTCTACGTGGACGGCAGCTGGAGCCCGTGGAGCAAGTGGTCGGCCTGTGGGCT 
GGACTGCACCCACTGGCGGAGCCGTGAGTGCTCTGACCCAGCACCCCGCAACGGAGGGGAGGAGTGCCAGGGCACTG 
ACCTGGACACCCGCAACTGTACCAGTGACCTCTGTGTACACACTGCTTCTGGCCCTGAGGACGTGGCCCTCTATGTG 
GGCCTCATCGCCGTGGCCGTCTGCCTGGTCCTGCTGCTGCTTGTCCTCATCCTCGTTTATTGCCGGAAGAAGGAGGG 
GCTGGACTCAGATGTGGCTGACTCGTCCATTCTCACCTCAGGCTTCCAGCCCGTCAGCATCAAGCCCAGCAAAGCAG 
ACAACCCCCATCTGCTCACCATCCAGCCGGACCTCAGCACCACCACCACCACCTACCAGGGCAGTCTCTGTCCCCGG 
CAGGATGGGCCCAGCCCCAAGTTCCAGCTCACCAATGGGCACCTGCTCAGCCCCCTGGGTGGCGGCCGCCACACACT 
GCACCACAGCTCTCCCACCTCTGAGGCCGAGGAGTTCGTCTCCCGCCTCTCCACCCAGAACTACTTCCGCTCCCTGC 
CCCGAGGCACCAGCAACATGACCTATGGGACCTTCAACTTCCTCGGGGGCCGGCTGATGATCCCTAATACAGGAATC 
AGCCTCCTCATCCCCCCAGATGCCATACCCCGAGGGAAGATCTATGAGATCTACCTCACGCTGCACAAGCCGGAAGA 
CGTGAGGTTGCCCCTAGCTGGCTGTCAGACCCTGCTGAGTCCCATCGTTAGCTGTGGACCCCCTGGCGTCCTGCTCA 
CCCGGCCAGTCATCCTGGCTATGGACCACTGTGGGGAGCCCAGCCCTGACAGCTGGAGCCTGCGCCTCAAAAAGCAG 
TCGTGCGAGGGCAGCTGGGAGGATGTGCTGCACCTGGGCGAGGAGGCGCCCTCCCACCTCTACTACTGCCAGCTGGA 
GGCCAGTGCCTGCTACGTCTTCACCGAGCAGCTGGGCCGCTTTGCCCTGGTGGGAGAGGCCCTCAGCGTGGCTGCCG 
CCAAGCGCCTCAAGCTGCTTCTGTTTGCGCCGGTGGCCTGCACCTCCCTCGAGTACAACATCCGGGTCTACTGCCTG 
CATGACACCCACGATGCACTCAAGGAGGTGGTGCAGCTGGAGAAGCAGCTGGGGGGACAGCTGATCCAGGAGCCACG 
GGTCCTGCACTTCAAGGACAGTTACCACAACCTGCGCCTATCCATCCACGATGTGCCCAGCTCCCTGTGGAAGAGTA 
AGCTCCTTGTCAGCTACCAGGAGATCCCCTTTTATCACATCTGGAATGGCACGCAGCGGTACTTGCACTGCACCTTC 
ACCCTGGAGCGTGTCAGCCCCAGCACTAGTGACCTGGCCTGCAAGCTGTGGGTGTGGCAGGTGGAGGGCGACGGGCA 
GAGCTTCAGCATCAACTTCAACATCACCAAGGACACAAGGTTTGCTGAGCTGCTGGCTCTGGAGAGTGAAGCGGGGG 
TCCCAGCCCTGGTGGGCCCCAGTGCCTTCAAGATCCCCTTCCTCATTCGGCAGAAGATAATTTCCAGCCTGGACCCA 
CCCTGTAGGCGGGGTGCCGACTGGCGGACTCTGGCCCAGAAACTCCACCTGGACAGCCATCTCAGCTTCTTTGCCTC 
CAAGCCCAGCCCCACAGCCATGATCCTCAACCTGTGGGAGGCGCGGCACTTCCCCAACGGCAACCTCAGCCAGCTGG 
CTGCAGCAGTGGCTGGACTGGGCCAGCCAGACGCTGGCCTCTTCACAGTGTCGGAGGCTGAGTGCTGAGGCCGGCCA 
GGCCCGACACCTACACTCTCACCAGCTTTGGCACCCACCAAGGACAGGCAGAAGCCGGACAGGGGCCCTTCCCCACA 
CCGGGGAGAGCTGCTCGGACAGGCCCCCTCCCGGCCGAAGCTGTCCCTTAATGCTGGTCCTTCAGACCCTGCCCGAA 
CTCCCACCTCTCCATGGCCTGCCTAGCCAGGCTGGCACTGCCACTCACACTCGGCCCCAGGGCCCAGGAGGGACAGT 
GCCTGGAGCCTGGGCCAGGCCCAGCCCATCTGTGTGTGTGTATGTGCGTGTGATGCTACCTCTCCTCCCGTCCCTCT 
CCAGGGGCCCCGCATACACACGGCCATGCACGCACACACTGGGCCTGGGCCAGGGCCCCAGAGCTCCTGCCTGAGCT 
GGACCTTATGCAAACATTTCTGTGCCTGCTGGGTAGGGGCACGTCTGAGGGGCCCTGCTCCAAGCCTGCAGGACCGA 
GGGCCACAGCCGGACAGGGGGTAGCCCCTGGATTCAGGCACACGACCACCACACGAGCACGTGCCACGCATGCCTCG 
TGTGCTCATCTCACACACACCCCCCTCCCGGGTCACGCAGACACCCCCCAACCACACACATCTCATGCCGTACACCT 
GAGGCTGCTCACGTCTCACGCCCAGTGTTGGTGCACATTTGCCTCTCACATGCTGCCCTCTCCACCCACCCAGGGAC 
ACCCCACGGCTCCTCCCTGCCCCTGCCCCTCCCCCAGCCTTGAGGTGCCCTGCCCGGCGGGGCCTGTGAATATGCAA 
TGGGAGTCCCAGGCTGTACAGTGGTGAGTGTGTGTGTGGCGTGGCGTGCCCGTCCCCAGGGCTGGCTGGTGCCCCAC 
GCGGGGCCTGTCATGTGAAGCTCGTGTCCTGACTTTGTCTTAAGTGCATTCACGCACTTACTCTTGGCCTTATGTAC 
ACAGCCTTGCCCGGCCGCCGGGGCACATAGGGGTTTTATCGGGCGTGAATGTAAATAAATTATATATATATATTGCT 
AAAAAAAAAAAAAAAAAAA 
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TAAGGTAATTCGTATGCAAGAAGCTACACGTAATTAAATGTGCAGGATGAAAAGATGGCACAGGCACTGTTGGTACC 
CCCAGGACCTGAAAGCTTCCGCCTTTTTACTAGAGAATCTCTTGCTGCTATCGAAAAACGTGCTGCAGAAGAGAAAG 
CCAAGAAGCCCAAAAAGGAACAAGATAATGATGATGAGAACAAACCAAAGCCAAATAGTGACTTGGAAGCTGGAAAG 
AACCTTCCATTTATTTATGGAGACATTCCTCCAGAGATGGTGTCAGAGCCCCTGGAGGACCTGGATCCCTACTATAT 
CAATAAGAAAACTTTTATAGTAATGAATAAAGGAAAGGCAATTTTCCGATTCAGTGCCACCTCTGCCTTGTATATTT 
TAACTCCACTAAACCCTGTTAGGAAAATTGCTATCAAGATTTTGGTACATTCTTTATTCAGCATGCTTATCATGTGC 
ACTATTTTGACCAACTGTGTATTTATGACCTTGAGCAACCCTCCTGACTGGACAAAGAATGTAGAGTACACATTCAC 
TGGAATCTATACCTTTGAGTCACTTATAAAAATCTTGGTAAGAGGGTTTTGCTTAGAAGATTTTACGTTTCTTCGTG 
ATCCATGGAACTGGCTGGATTTCAGTGTCATTGTGATGGCGTATGTAACAGAATTTGTAAGCCTAGGCAATGTTTCA 
GCCCTTCGAACTTTCAGAGTCTTGAGAGCTCTGAAAACTATTTCTGTAATTCCAGGTTTAAAGACCATTGTGGGGGC 
CCTGATCCAGTCGGTAAAGAAGCTTTCTGATGTGATGATCCTGACTGTGTTCTGTCTGAGCGTGTTTGCTCTCATTG 
GGCTGCAGCTGTTCATGGGCAATCTGAGGAATAAATGTTTGCAGTGGCCCCCAAGCGATTCTGCTTTTGAAACCAAC 
ACCACTTCCTACTTTAATGGCACAATGGATTCAAATGGGACATTTGTTAATGTAACAATGAGCACATTTAACTGGAA 
GGATAACATTGGAGATGACAGTCACTTTTATGTTTTGGATGGGCAAAAAGACCCTTTACTCTGTGGAAATGGTTCAG 
ATGCAGGCCAGTGTCCAGAAGGATACATCTGTGTGAAGGCTGGTCGAAACCCCAACTATGGCTACACAAGCTTTGAC 
ACCTTTAGCTGGGCTTTCCTGTCTCTATTTCGACTCATGACTCAAGACTACTGGGAAAATCTTTACCAGTTGACATT 
ACGTGCTGCTGGGAAAACATACACGATATTTTTTGTCCTGGTCATTTTCTTGGGCTCATTTTATTTGGTGAATTTGA 
TCCTGGCTGTGGTGGCCATGGCCTATGAGGAGCAGAATCAGGCCACCTTGGAAGAAGCAGAACAAAAAGAGGCCGAA 
TTTCAGCAGATGCTCGAACAGCTTAAAAAGCAACAGGAAGAAGCTCAGGCAGTTGCGGCAGCATCAGCTGCTTCAAG 
AGATTTCAGTGGAATAGGTGGGTTAGGAGAGCTGTTGGAAAGTTCTTCAGAAGCATCAAAGTTGAGTTCCAAAAGTG 
CTAAAGAATGGAGGAACCGAAGGAAGAAAAGAAGACAGAGAGAGCACCTTGAAGGAAACAACAAAGGAGAGAGAGAC 
AGCTTTCCCAAATCCGAATCTGAAGACAGCGTCAAAAGAAGCAGCTTCCTTTTCTCCATGGATGGAAACAGACTGAC 
CAGTGACAAAAAATTCTGCTCCCCTCATCAGTCTCTCTTGAGTATCCGTGGCTCCCTGTTTTCCCCAAGACGCAATA 
GCAAAACAAGCATTTTCAGTTTCAGAGGTCGGGCAAAGGATGTTGGATCTGAAAATGACTTTGCTGATGATGAACAC 
AGCACATTTGAAGACAGCGAAACGAGGAGAGACTCACTGTTTGAGCCGCACAGACATGGAGAGCGACGCAACAGTAA 
CGGCACCACCACAGAAACGGAGGTCAGAAAGAGAAGGTTAAGTTCTTACCAGATTTCAATGGAGATGCTGGAGGATT 
CTTCTGGAAGGCAAAGAGCCGTGAGCATAGCCAGCATTCTGACCAACACAATGGAAGAACTTGAAGAATCTAGACAG 
AAATGTCCGCCATGCTGGTATAGATTTGCCAATGTGTTCTTGATCTGGGACTGCTGTGATGCATGGTTAAAAGTAAA 
ACATCTTGTGAATTTAATTGTTATGGATCCATTTGTTGATCTTGCCATCACTATTTGCATTGTCTTAAATACCCTCT 
TTATGGCCATGGAGCACTACCCCATGACTGAGCAATTCAGTAGTGTGTTGACTGTAGGAAACCTGGTCTTTACTGGG 
ATTTTCACAGCAGAAATGGTTCTCAAGATCATTGCCATGGATCCTTATTACTATTTCCAAGAAGGCTGGAATATCTT 
TGATGGAATTATTGTCAGCCTCAGTTTAATGGAGCTTGGTCTGTCAAATGTGGAGGGATTGTCTGTACTGCGATCAT 
TCAGACTGCTTAGAGTTTTCAAGTTGGCAAAATCCTGGCCCACACTAAATATGCTAATTAAGATCATTGGCAATTCT 
GTGGGGGCTCTAGGAAACCTCACCTTGGTGTTGGCCATCATCGTCTTCATTTTTGCTGTGGTCGGCATGCAGCTCTT 
TGGTAAGAGCTACAAAGAATGTGTCTGCAAGATCAATGATGACTGTACGCTCCCACGGTGGCACATGAACGACTTCT 
TCCACTCCTTCCTGATTGTGTTCCGCGTGCTGTGTGGAGAGTGGATAGAGACCATGTGGGACTGTATGGAGGTCGCT 
GGCCAAACCATGTGCCTTATTGTTTTCATGTTGGTCATGGTCATTGGAAACCTTGTGGTTCTGAACCTCTTTCTGGC 
CTTATTGTTGAGTTCATTTAGCTCAGACAACCTTGCTGCTACTGATGATGACAATGAAATGAATAATCTGCAGATTG 
CAGTAGGAAGAATGCAAAAGGGAATTGATTATGTGAAAAATAAGATGCGGGAGTGTTTCCAAAAAGCCTTTTTTAGA 
AAGCCAAAAGTTATAGAAATCCATGAAGGCAATAAGATAGACAGCTGCATGTCCAATAATACTGGAATTGAAATAAG 
CAAAGCGCTTAATTATCTTAGAGATGGGAATGGAACCACCAGTGGTGTAGGTACTGGAAGCAGTGTTGAAAAATACG 
TAATCGATGAAAATGATTATATGTCATTCATAAACAACCCCAGCCTCACCGTCACAGTGCCAATTGCTGTTGGAGAG 
TCTGACTTTGAAAACTTAAATACTGAAGAGTTCAGCAGTGAGTCAGAACTAGAAGAAAGCAAAGAGAAATTAAATGC 
AACCAGCTCATCTGAAGGAAGCACAGTTGATGTTGTTCTACCCCGAGAAGGTGAACAAGCTGAAACTGAACCCGAAG 
AAGACCTTAAACCGGAAGCTTGTTTTACTGAAGGATGTATTAAAAAGTTTCCATTCTGTCAAGTAAGTACAGAAGAA 
GGCAAAGGGAAGATCTGGTGGAATCTTCGAAAAACCTGCTACAGTATTGTTGAGCACAACTGGTTTGAGACTTTCAT 
TGTGTTCATGATCCTTCTCAGTAGTGGTGCATTGGCCTTTGAAGATATATACATTGAACAGCGAAAGACTATCAAAA 
CCATGCTAGAATATGCTGACAAAGTCTTTACCTATATATTCATTCTGGAAATGCTTCTCAAATGGGTTGCTTATGGA 
TTTCAAACATATTTCACTAATGCCTGGTGCCGGCTAGATTTCTTGATCGTTGATGTTTCTTTGGTTAGCCTGGTAGC 
CAATGCTCTTGGCTACTCAGAACTCGGTGCCATCAAATCATTACGGACATTAAGAGCTTTAAGACCTCTAAGAGCCT 
TATCCCGGTTTGAAGGCATGAGGGTGGTTGTGAATGCTCTTGTTGGAGCAATTCCCTCTATCATGAATGTGCTGTTG 
GTCTGTCTCATCTTCTGGTTGATCTTTAGCATCATGGGTGTGAATTTGTTTGCTGGCAAGTTCTACCAC 
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TGTGTTAACATGACAACGGGTAACATGTTTGACATTAGTGATGTTAACAATTTGAGTGACTGTCAGGCTCTTGGCAA 
GCAAGCTCGGTGGAAAAACGTGAAAGTAAACTTTGATAATGTTGGCGCTGGCTATCTTGCACTGCTTCAAGTGGTAAGTG 
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AGCGAAGCGGAGGCATAAGCAGAGAGGATTCTGGAAAGGTCTCTTTGTTTTCTTATCCACAGAGAAAGAAAGAAAAA 
AAATTGTAACTAATTTGTAAACCTCTGTGGTCAAAAAAAAAAAAAAAAAAAAAGCTGAACAGCTGCCAGAGGAAGAC 
ACGTTATACCCTAACCATCTTGGATGCTGGGCTTTGTTATGCTGTAATTCATAAGGCTCTGTTTTATCAGAGATTAT 
GGAGCAAGAAAACTGAAGCCAAGCCACATCAAGGTTTGACAGGGATGAGATACCTGTCAAGGATTCATAGTAGAGTG 
GCTTACTGGGAAAGGAGCAAAGAATCTCTTCTAGGGATATTGTAAGAATAAATGAGATAATTCACAGAAGGGACCTG 
GAGCTTTTCCGGAAAAAGGTGCTGTGACTATCTAAGGTAATTCGTATGCAAGAAGCTACACGTAATTAAATGTGCAG 
GATGAAAAGATGGCACAGGCACTGTTGGTACCCCCAGGACCTGAAAGCTTCCGCCTTTTTACTAGAGAATCTCTTGC 
TGCTATCGAAAAACGTGCTGCAGAAGAGAAAGCCAAGAAGCCCAAAAAGGAACAAGATAATGATGATGAGAACAAAC 
CAAAGCCAAATAGTGACTTGGAAGCTGGAAAGAACCTTCCATTTATTTATGGAGACATTCCTCCAGAGATGGTGTCA 
GAGCCCCTGGAGGACCTGGATCCCTACTATATCAATAAGAAAACTTTTATAGTAATGAATAAAGGAAAGGCAATTTT 
CCGATTCAGTGCCACCTCTGCCTTGTATATTTTAACTCCACTAAACCCTGTTAGGAAAATTGCTATCAAGATTTTGG 
TACATTCTTTATTCAGCATGCTTATCATGTGCACTATTTTGACCAACTGTGTATTTATGACCTTGAGCAACCCTCCT 
GACTGGACAAAGAATGTAGAGTACACATTCACTGGAATCTATACCTTTGAGTCACTTATAAAAATCTTGGCAAGAGG 
GTTTTGCTTAGAAGATTTTACGTTTCTTCGTGATCCATGGAACTGGCTGGATTTCAGTGTCATTGTGATGGCATATG 
TGACAGAGTTTGTGGACCTGGGCAATGTCTCAGCGTTGAGAACATTCAGAGTTCTCCGAGCACTGAAAACAATTTCA 
GTCATTCCAGGTTTAAAGACCATTGTGGGGGCCCTGATCCAGTCGGTAAAGAAGCTTTCTGATGTGATGATCCTGAC 
TGTGTTCTGTCTGAGCGTGTTTGCTCTCATTGGGCTGCAGCTGTTCATGGGCAATCTGAGGAATAAATGTTTGCAGT 
GGCCCCCAAGCGATTCTGCTTTTGAAACCAACACCACTTCCTACTTTAATGGCACAATGGATTCAAATGGGACATTT 
GTTAATGTAACAATGAGCACATTTAACTGGAAGGATTACATTGGAGATGACAGTCACTTTTATGTTTTGGATGGGCA 
AAAAGACCCTTTACTCTGTGGAAATGGCTCAGATGCAGGCCAGTGTCCAGAAGGATACATCTGTGTGAAGGCTGGTC 
GAAACCCCAACTATGGCTACACAAGCTTTGACACCTTTAGCTGGGCTTTCCTGTCTCTATTTCGACTCATGACTCAA 
GACTATTGGGAAAATCTTTACCAGTTGACATTACGTGCTGCTGGGAAAACATACATGATATTTTTTGTCCTGGTCAT 
TTTCTTGGGCTCATTTTATTTGGTGAATTTGATCCTGGCTGTGGTGGCCATGGCCTATGAGGAGCAGAATCAGGCCA 
CCTTGGAAGAAGCAGAACAAAAAGAGGCCGAATTTCAGCAGATGCTCGAACAGCTTAAAAAGCAACAGGAAGAAGCT 
CAGGCAGTTGCGGCAGCATCAGCTGCTTCAAGAGATTTCAGTGGAGTAGGTGGGTTAGGAGAGCTGTTGGAAAGTTC 
TTCAGAAGCATCAAAGTTGAGTTCCAAAGGTGCTAAAGAATGGAGGAACCGGAGGAAGAAAAGAAGACAGAGAGAGC 
ACCTTGAAGGAAACAACAAAGGAGAGAGAGACAGCTTTCCCAAATCCGAATCTGAAGACAGCGTCAAAAGAAGCAGC 
TTCCTTTTCTCCATGGATGGAAACAGACTGACCAGTGACAAAAAATTCTGCTCCCCTCATCAGTCTCTCTTGAGTAT 
CCGTGGCTCCCTGTTTTCCCCAAGACGCAATAGCAAAACAAGCATTTTCAGTTTCAGAGGTCGGGCAAAGGATGTTG 
GATCTGAAAATGACTTTGCTGATGATGAACACAGCACATTTGAAGACAGCGAAAGCAGGAGAGACTCACTGTTTGTG 
CCGCACAGACATGGAGAGCGACGCAACAGTAACGGCACCACCACTGAAACGGAAGTCAGAAAGAGAAGGTTAAGCTC 
TTACCAGATTTCAATGGAGATGCTGGAGGATTCCTCTGGAAGGCAAAGAGCCGTGAGCATAGCCAGCATTCTGACCA 
ACACAATGGAAGAACTTGAAGAATCTAGACAGAAATGTCCGCCATGCTGGTATAGATTTGCCAATGTGTTCTTGATC 
TGGGACTGCTGTGATGCATGGTTAAAAGTAAAACATCTTGTGAATTTAATTGTTATGGATCCATTTGTTGATCTTGC 
CATCACTATTTGCATTGTCTTAAATACCCTCTTTATGGCCATGGAGCACTACCCCATGACTGAGCAATTCAGTAGTG 
TGTTGACTGTAGGAAACCTGGTCTTTACTGGGATTTTCACAGCAGAAATGGTTCTCAAGATCATTGCCATGGATCCT 
TATTACTATTTCCAAGAAGGCTGGAATATCTTTGATGGAATTATTGTCAGCCTCAGTTTAATGGAGCTTGGTCTGTC 
AAATGTGGAGGGATTGTCTGTACTGCGATCATTCAGACTGCTTAGAGTTTTCAAGTTGGCAAAATCCTGGCCCACAC 
TAAATATGCTAATTAAGATCATTGGCAATTCTGTGGGGGCTCTAGGAAACCTCACCTTGGTGTTGGCCATCATCGTC 
TTCATTTTTGCTGTGGTCGGCATGCAGCTCTTTGGTAAGAGCTACAAAGAATGTGTCTGCAAGATCAATGATGACTG 
TACGCTCCCACGGTGGCACATGAACGACTTCTTCCACTCCTTCCTGATTGTGTTCCGCGTGCTGTGTGGAGAGTGGA 
TAGAGACCATGTGGGACTGTATGGAGGTCGCTGGCCAAACCATGTGCCTTATTGTTTTCATGTTGGTCATGGTCATT 
GGAAACCTTGTGGTTCTGAACCTCTTTCTGGCCTTATTATTGAGTTCATTTAGCTCAGACAACCTTGCTGCTACTGA 
TGATGACAATGAAATGAATAATCTGCAGATTGCAGTAGGAAGAATGCAAAAGGGAATTGATTATGTGAAAAATAAGA 
TGCGGGAGTGTTTCCAAAAAGCCTTTTTTAGAAAGCCAAAAGTTATAGAAATCCATGAAGGCAATAAGATAGACAGC 
TGCATGTCCAATAATACTGGAATTGAAATAAGCAAAGAGCTTAATTATCTTAGAGATGGGAATGGAACCACCAGTGG 
TGTAGGTACTGGAAGCAGTGTTGAAAAATACGTAATCGATGAAAATGATTATATGTCATTCATAAACAACCCCAGCC 
TCACCGTCACAGTGCCAATTGCTGTTGGAGAGTCTGACTTTGAAAACTTAAATACTGAAGAGTTCAGCAGTGAGTCA 
GAACTAGAAGAAAGCAAAGAGAAATTAAATGCAACCAGCTCATCTGAAGGAAGCACAGTTGATGTTGTTCTACCCCG 
AGAAGGTGAACAAGCTGAAACTGAACCCGAAGAAGACTTTAAACCGGAAGCTTGTTTTACTGAAGGGTGTATTAAAA 
AGTTTCCATTCTGTCAAGTAAGTACAGAAGAAGGCAAAGGGAAGATCTGGTGGAATCTTCGAAAAACCTGCTACAGT 
ATTGTTGAGCACAACTGGTTTGAGACTTTCATTGTGTTCATGATCCTTCTCAGTAGTGGTGCATTGGCCTTTGAAGA 
TATATACATTGAACAGCGAAAGACTATCAAAACCATGCTAGAATATGCTGACAAAGTCTTTACCTATATATTCATTC 
TGGAAATGCTTCTCAAATGGGTTGCTTATGGATTTCAAACATATTTCACTAATGCCTGGTGCTGGCTAGATTTCTTG 
ATCGTTGATGTTTCTTTGGTTAGCCTGGTAGCCAATGCTCTTGGCTACTCAGAACTCGGTGCCATCAAATCATTACG 
GACATTAAGAGCTTTAAGACCTCTAAGAGCCTTATCCCGGTTTGAAGGCATGAGGGTGGTTGTGAATGCTCTTGTTG 
GAGCAATTCCCTCTATCATGAATGTGCTGTTGGTCTGTCTCATCTTCTGGTTGATCTTTAGCATCATGGGTGTGAAT 
TTGTTTGCTGGCAAGTTCTACCACTGTGTTAACATGACAACGGGTAACATGTTTGACATTAGTGATGTTAACAATTT 
GAGTGACTGTCAGGCTCTTGGCAAGCAAGCTCGGTGGAAAAACGTGAAAGTAAACTTTGATAATGTTGGCGCTGGCT 
ATCTTGCACTGCTTCAAGTGGCCACATTTAAAGGCTGGATGGATATTATGTATGCAGCTGTTGATTCACGAGATGTT 
AAACTTCAGCCTGTATATGAAGAAAATCTGTACATGTATTTATACTTTGTCATCTTTATCATCTTTGGGTCATTCTT 
CACTCTGAATCTATTCATTGGTGTCATCATAGATAACTTCAACCAGCAGAAAAAGAAGTTTGGAGGTCAAGACATCT 
TTATGACAGAGGAACAGAAAAAATATTACAATGCAATGAAGAAACTTGGATCCAAGAAACCTCAGAAACCCATACCT 
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CGCCCAGCAAACAAATTCCAAGGAATGGTCTTTGATTTTGTAACCAGACAAGTCTTTGATATCAGCATCATGATCCT 
CATCTGCCTCAACATGGTCACCATGATGGTGGAAACGGATGACCAGGGCAAATACATGACCCTAGTTTTGTCCCGGA 
TCAACCTAGTGTTCATTGTTCTGTTCACTGGAGAATTTGTGCTGAAGCTCGTTTCCCTCAGACACTACTACTTCACT 
ATAGGCTGGAACATCTTTGACTTTGTGGTGGTGATTCTCTCCATTGTAGGTATGTTTCTGGCTGAGATGATAGAAAA 
GTATTCTGTGTCCCCTACCTTGTTCCGAGTGATCCGTCTTGCCAGGATTGGCCGAATCCTACGTCTGATCAAAGGAG 
CAAAGGGGATCCGCACGCTGCTCTTTGCTTTGATGATGTCCCTTCCTGCGTTGTTTAACATCGGCCTCCTGCTCTTC 
CTGGTCATGTTTATCTATGCCATCTTTGGGATGTCCAACTTTGCCTATGTTAAAAAGGAAGCTGGAATTGATGACAT 
GTTCAACTTTGAGACCTTTGGCAACAGCATGATCTGCTTGTTCCAAATTACAACCTCTGCTGGCTGGGATGGATTGC 
TAGCACCTATTCTTAATAGTGCACCACCCGACTGTGACCCTGACACAATTCACCCTGGCAGCTCAGTTAAGGGAGAC 
CGTGGGGACCCATCTGTTGGGATTTTCTTTTTTGTCAGTTACATCATCATATCCTTCCTGGTTGTGGTGAACATGTA 
CATCGCGGTCATCCTGGAGAACTTCAGTGTTGCTACTGAAGAAAGTGCAGAGCCCCTGAGTGAGGATGACTTTGAGA 
TGTTCTATGAGGTTTGGGAAAAGTTTGATCCCGATGCGACCCAGTTTATAGAGTTCTCTAAACTCTCTGATTTTGCA 
GCTGCCCTGGATCCTCCTCTTCTCATAGCAAAACCCAACAAAGTCCAGCTTATTGCCATGGATCTGCCCATGGTCAG 
TGGTGACCGGATCCACTGTCTTGATATTTTATTTGCCTTTACAAAGCGTGTTTTGTGTGAGAGTGGAGAGATGGATG 
CCCTTCGAATACAGATGGAAGACAGGTTTATGGCATCAAACCCCTCCAAAGTCTCTTATGAGCCTATTACAACCACT 
TTGAAACGTAAACAAGAGGAGGTGTCTGCCGCTATCATTCAGCGTAATTTCAGATGTTATCTTTTAAAGCAAAGGTT 
AAAAAATATATCAAGTAACTATAACAAAGAGGCAATTAAAGGGAGGATTGACTTACCTATAAAACAAGACATGATTA 
TTGACAAACTAAATGGGAACTCCACTCCAGAAAAAACAGATGGGAGTTCCTCTACCACCTCTCCTCCTTCCTATGAT 
AGTGTAACAAAACCAGACAAGGAAAAGTTTGAGAAAGACAAACCAGAAAAAGAAAGCAAAGGAAAAGAGGTCAGAGA 
AAATCAAAAGTAAAAAGAAACAAAGAATTATCTTTGTGATCAATTGTTTACAGCCTATGAAGGTAAAGTATATGTGT 
CAACTGGACTTCAAGAGGAGGTCCATGCCAAACTGACTGTTTTAACAAATACTCATAGTCAGTGCCTATACAAGACA 
GTGAAGTGACCTCTCTGTCACTGCAACTCTGTGAAGCAGGGTATCAACGTTGACAAGAGGTTGCTGTTTTTATTACC 
AGCTGACACTGCTGAGGAGAAACCCAATGGCTACCTAGACTATAGGGATAGTTGTGCAAAGTGAACATTGTAACTAC 
ACCAAACACCTTTAGTACAGTCCTTGCATCCATTCTATTTTTAACTTCCATATCTGCCATATTTTTACAAAATTTGT 
TCTAGTGCATTTCCATGGTCCCCAATTCATAGTTTATTCATAATGCTATGTCACTATTTTTGTAAATGAGGTTTACG 
TTGAAGAAACAGTATACAAGAACCCTGTCTCTCAAATGATCAGACAAAGGTGTTTTGCCAGAGAGATAAAATTTTTG 
CTCAAAACCAGAAAAAGAATTGTAATGGCTACAGTTTCAGTTACTTCCATTTTCTAGATGGCTTTAATTTTGAAAGT 
ATTTTAGTCTGTTATGTTTGTTTCTATCTGAACAGTTATGTGCCTGTAAAGTCTCCTCTAATATTTAAAGGATTATT 
TTTATGCAAAGTATTCTGTTTCAGCAAGTGCAAATTTTATTCTAAGTTTCAGAGCTCTATATTTAATTTAGGTCAAA 
TGCTTTCCAAAAAGTAATCTAATAAATCCATTCTAGAAAAATATATCTAAAGTATTGCTTTAGAATAGTTGTTCCAC 
TTTCTGCTGCAGTATTGCTTTGCCATCTTCTGCTCTCAGCAAAGCTGATAGTCTATGTCAATTAAATACCCTATGTT 
ATGTAAATAGTTATTTTATCCTGTGGTGCATGTTTGGGCAAATATATATATAGCCTGATAAACAACTTCTATTAAAT 
CAAATATGTACCACAGTGTATGTGTCTTTTGCAAGCTTCCAACAGGGATGTATCCTGTATCATTCATTAAACATAGT 
TTAAAGGCTATCACTAATGCATGTTAATATTGCCTATGCTGCTCTATTTTACTCAATCCATTCTTCACAAGTCTTGG 
TTAAAGAATGTCACATATTGGTGATAGAATGAATTCAACCTGCTCTGTCCATTATGTCAAGCAGAATAATTTGAAGC 
TATTTACAAACACCTTTACTTTTGCACTTTTAATTCAACATGAGTATCATATGGTATCTCTCTGGATTTCAAGGAAA 
CACACTGGATACTGCCTACTGACAAAACCTATTCTTCATATTTTGCTAAAAATATGTCTAAAACTTGTTTAAATATA 
AATAATGTAAAAATATAATCAACTTTATTTGTCAGCATTTTGTACATAAGAAAATTATTTTCAGGTTGATGACATCA 
CAATTTATTTTACTTTATGCTTTTGCTTTTGATTTTTAATCACAATTCCAAACTTTTGAATCCATAAGATTTTTCAA 
TGGATAATTTCCTAAAATAAAAGTTAGATAATGGGTTTTATGGATTTCTTTGTTATAATATATTTTCTACCATTCCA 
ATAGGAGATACATTGGTCAAACACTCAAACCTAGATCATTTTCTACCAACTATGGTTGCCTCAATATAACCTTTTAT 
TCATAGATGTTTTTTTTTATTCAACTTTTGTAGTATTTACGTATGCAGACTAGTCTTATTTTTTTAATTCCTGCTGC 
ACTAAAGCTATTACAAATATAACATGGACTTTGTTCTTTTTAGCCATGAACAAAGTGGCAAAGTTGTGCAATTACCT 
AACATGATATAAATTTTTGTTTTTTGCACAAACCAAAAGTTTAATGTTAATTCTTTTTACAAAACTATTTACTGTAG 
TGTATTGAAGAACTGCATGCAGGGAATTGCTATTGCTAAAAAGAATGGTGAGCTACGTCATTATTGAGCCAAAAGAA 
TAAATTTCATTTTTTATTGCATTTCACTTATTGGGCTCTGGGGTTTTTTGTTTTTGTTTTTTGCTGTTGGCAGTTTA 
AAATATATATAATTAATAAAACCTGTGCTTGATCTGACATTTGTATACATAAAAGTTTACATGAATTTTACAACAAA 
CTAGTGCATGATTCACCAAGCAGTACTACAGAACAAAGGCAAATTAAAAGCAGCTTTGTGAACTTTTATGTGTGCAA 
AGGATCAAGTTCACATGTTCCAACTTTCAGGTTTGATAATAATAGTAGTAACCACCTACAATAGCTTTCAATTTCAA 
TTAACTCCCTTGGCTATAAGCATCTAAACTCATCTTCTTTCAATATAATTGATGCTATCTCCTAATTACTTGGTGGC 
TAATAAATGTTACATTCTTTGTTACTTAAATGCATTATATAAACTCCTATGTATACATAAGGTATTAATGATATAGT 
TATTGAGAATTTATATTAACTTTTTTTTCAAGAACCCTTGGATTTATGTGAGGTCAAAACCAAACTCTTATTCTCAG 
TGGAAAACTCCAGTTGTAATGCATATTTTTAAAGACAATTTGGATCTAAATATGTATTTCATAATTCTCCCATAATA 
AATTATATAAGGTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 80 

AAAAAAATTAAAAAAAAAAAAAAAAACAGAAAAAAAAACATAGTACATGCCAAGATATTATTATGACAATTACAAAT 
ACAAATAAATTATGATCTTTGACCTCAGCATATTTATTAACTAAAAGGGAAGATAAAACAGGCACATAACTATAACA 
GGGGCACCAGTCATGGGCGCCGCAGCCGCTCAGGCGCCTCTCGGGCTGCCTGCGGCCTCCGCTCGCCTTCTGCTGCT 
AGCGACGTCGGTGCTTCTGCTCTTCGCCTTCTCTCTGCCCGGGAGCCGCGCGTCCAACCAGCCCCCGGGTGGTGGCG 
GCGGCACGGGCGGGGACTGTCCCGGCGGCAAAGGCAAGAGCATCAACTGCTCAGAATTAAATGTGAGGGAGTCTGAC 
GTAAGAGTTTGTGATGAGTCATCATGTAAATATGGAGGAGTCTGTAAAGAAGATGGAGATGGTTTGAAATGTGCATG 
CCAATTTCAGTGCCATACAAATTATATTCCTGTCTGTGGATCAAATGGGGACACTTATCAAAATGAATGCTTTCTCA 
GAAGGGCTGCTTGTAAGCACCAGAAAGAGATAACAGTAATAGCAAGAGGACCATGCTACTCTGATAATGGATCTGGA 
TCTGGAGAAGGAGAAGAGGAAGGGTCAGGGGCAGAAGTTCACAGAAAACACTCCAAGTGTGGACCCTGCAAATATAA 
AGCTGAGTGTGATGAAGATGCAGAAAATGTTGGGTGTGTATGTAATATAGATTGCAGTGGATACAGTTTTAATCCTG 
TGTGTGCTTCTGATGGGAGTTCCTATAACAATCCCTGTTTTGTTCGAGAAGCATCTTGTATAAAGCAAGAACAAATT 
GATATAAGGCATCTTGGTCATTGCACAGATACAGATGACACTAGTTTGTTGGGAAAGAAAGATGATGGACTACAATA 
TCGACCAGATGTGAAAGATGCTAGTGATCAAAGAGAAGATGTTTATATTGGAAACCACATGCCTTGCCCTGAAAACC 
TCAATGGTTACTGCATCCATGGAAAATGTGAATTCATCTATCTACTCAGAAGGGCTTCTTGTAGATGTGAATCTGGC 
TACACTGGACAGCACTGTGAAAAGACAGACTTTAGTATTCTCTATGTAGTGCCAAGTAGGCAAAAGCTCACTCATGT 
TCTTATTGCAGCAATTATTGGAGCTGTACAGATTGCCATCATAGTAGCAATTGTAATGTGCATAACAAGAAAATGCC 
CCAAAAACAATAGAGGACGTCGACAGAAGCAAAACCTAGGTCATTTTACTTCAGATACGTCATCCAGAATGGTTTAA 
ACTGATGACTTTTATATGTACACTGACCATGTGTATGTACATTTATTATGTCTTTTTTTAAAGAATGGAAATATTTA 
TTTCAGAAGGCCTTATTTTTGGACATTTTATAGTGTAGTACTGTTGGCTCGATATTTGAATATTCAGCTACGACAGT 
TTTGGACTGTTTAGTAGTCTTTGTTTTATGTTTTTAAATACAGAAATTGCTTCACAAATTTGTACCACATGGTAATT 
CTAAGACTTGTTCTTTACCCATGGAATGTAATATTTTTGCAAAGATGGACTACTTCACAAATGGTTATAAAGTCATA 
TCCACTTCTTCCACAATGACCACAGCAAATGACCCAAGCATGAACTAAAGAAGAG 
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CGCCAGCATGCTGCCGGAGCAGCTCTACTTCCTGCAGAGCCCCCCGGAGGAGGAGCCCGAATACCACCCCGACGCCT 
CAGCCCAAGAATTAAATGTGAGGGAGTCTGACGTAAGAGTTTGTGATGAGTCATCATGTAAATATGGAGGAGTCTGT 
AAAGAAGATGGAGATGGTTTGAAATGTGCATGCCAATTTCAGTGCCATACAAATTATATTCCTGTCTGTGGATCAAA 
TGGGGACACTTATCAAAATGAATGCTTTCTCAGAAGGGCTGCTTGTAAGCACCAGAAAGAGATAACAGTAATAGCAA 
GAGGACCATGCTACTCTGATAATGGATCTGGATCTGGAGAAGGAGAAGAGGAAGGGTCAGGGGCAGAAGTTCACAGA 
AAACACTCCAAGTGTGGACCCTGCAAATATAAAGCTGAGTGTGATGAAGATGCAGAAAATGTTGGGTGTGTATGTAA 
TATAGATTGCAGTGGATACAGTTTTAATCCTGTGTGTGCTTCTGATGGGAGTTCCTATAACAATCCCTGTTTTGTTC 
GAGAAGCATCTTGTATAAAGCAAGAACAAATTGATATAAGGCATCTTGGTCATTGCACAGATACAGATGACACTAGT 
TTGTTGGGAAAGAAAGATGATGGACTACAATATCGACCAGATGTGAAAGATGCTAGTGATCAAAGAGAAGATGTTTA 
TATTGGAAACCACATGCCTTGCCCTGAAAACCTCAATGGTTACTGCATCCATGGAAAATGTGAATTCATCTATTCTA 
CTCAGAAGGCTTCTTGTAGATGTGAATCTGGCTACACTGGACAGCACTGTGAAAAGACAGACTTTAGTATTCTCTAT 
GTAGTGCCAAGTAGGCAAAAGCTCACTCATGTTCTTATTGCAGCAATTATTGGAGCTGTACAGATTGCCATCATAGT 
AGCAATTGTAATGTGCATAACAAGAAAATGCCCCAAAAACAATAGAGGACGTCGACAGAAGCAAAACCTAGGTCATT 
TTACTTCAGATACGTCATCCAGAATGGTTTAAACTGATGACTTTTATATGTACACTGACCATGTGATGTACATTTAT 
TATGTCTTTTTTTAAAGAATGGAAATATTTATTTCAGAGGCCTTATTTTTGGACATTTTTAGTGTAGTACTGTTGGC 
TCGTATTTAGAATATTCAGCTACGACAGTTTTGGACTGTTTAGTAGTCTTTGTTTTATGTTTTTAAATACAGAAATT 
GCTTTCACAAATTTGTACCACATGGTAATTCTAAGACTTGTTCTTTACCCATGGAATGTAATATTTTTGCAAAGATG 
GACTACTTCACAAATGGTTATAAAGTCATATCCACTTCTTCCACAATGACCACAGCAAATGACCAAGCATGAACTAA 
AGGTAAAGATGTTTACAGATTACTTTTCTTACAAAAAAAATCTAGAAGACACTGTGTTTAAATAGATATTTAAATGT 
TTTTGAGATTTAGTAACTGATTTTTTAGACACTGCCTATCGCATGAACTGTAAAGCTGTGTGTATTAGGTGTAAAAT 
ATTTATAAGATATATGGACTGGGGAATTTGATTATTCCTCCCTTTGAAAAAATAGTCCTAATAATTTGAACAAATAT 
GTTAGTAATGATGGAACAGATCAATGAAAAGTAGATATAGATATTGTGAAAATAGGCTGTTTAACAAACAGATTGGA 
ATAAAGCCTATTCTACCAGTTAAACTACTTTAATACACATTCATTTTTAAAGAAAATGTTTGTTTTAACATAAATAA 
ACAAATCGTATCAGTGTTTGTGAATAAAATACAAAAATGATTGTTAATGATTGGTGCTCTTAAAGTGAGCTTAAAAT 
TTATCCAAGACGTATATCCAAATTTGTCCTGTAGTAATAGATTAATATTCATAGATTGTTGGTGTTTAAAGATCTGA 
AGTGTGAGTAGAATGTATTCAGCTGTTTAACATGTAGTTTAGATATTCAAAAGTATGCATGTAGAATTTAAAGAATA 
TGT T AAAAAT T AT T AAT T TT AAT AT TT T GT T TGGAAAAGCATGT T AT AAT ATAATGTTTTCACT AT AAAGAAAAAAA 
AAAAAAAAAAAA 
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FIGURE 82 

MPTVDDILEQVGESGWFQKQAFLILCLLSAAFAPICVGIVFLGFTPDHHCQSPGVAELSQRCGWSPAEELNYTVPGL 
GPAGEAFLGQCRRYEVDWNQSALSCVDPLASLATNRSHLPLGPCQDGWVYDTPGSSIVTEFNLVCADSWKLDLFQSC 
LNAGFLFGSLGVGYFADRFGRKLCLLGTVLVNAVSGVLMAFSPNYMSMLLFRLLQGLVSKGNWMAGYTLITEFVGSG 
SRRTVAIMYQMAFTVGLVALTGLAYALPHWRWLQLAVSLPTFLFLLYYWCVPESPRWLLSQKRNTEAIKIMDHIAQK 
NGKLPPADLPCMLSLEEDVTEKLSPSFADLFRTPRLRKRTFILMYLWFTDSVLYQGLILHMGATSGNLYLDFLYSALV 
EI PGAFIALITIDRVGRI YPMAMSNLLAGAACLVMIFISPDLHWLNIIIMCVGRMGITIAIQMICLVNAELYPTFVR 
NLGVMVCSSLCDIGGIITPFIVFRLREVWQALPLILFAVLGLLAAGVTLLLPETKGVALPETMKDAENLGRKAKPKE 
NTIYLKVQTSEPSGT 

Transmembrane domains . 

amino acids 19-39, 175-195, 198-218, 235-255, 264-284, 350-370, 377-397, 
409-429, 457-477, 492-512 

N-glycosylation sites. 

amino acids 71-74, 96-99, 112-115 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 293-296, 344-347 

Tyrosine kinase phosphorylation site, 
amino acids 536-544 

N-myristoylation sites. 

amino acids 76-81, 190-195, 215-220, 231-236, 253-258, 369-374, 389-394, 
441-446, 465-470, 503-508 

Amidation sites. 

amino acids 173-176, 531-534 

Sugar (and other) transporter homology, 
amino acids 118-528 
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FIGURE 83 

MRSDKSALVFLLLQLFCVGCGFCGKVLVWPCDMSHWLNVKVILEELIVRGHEVTVLTHSKPSLIDYRKPSALKFEVV 
HMPQDRTEENEIFVDLALNVLPGLSTWQSVIKLNDFFVEIRGTLJ<MMCESFIYNQTLMKKLQETNYDVMLIDPVIPC 
GDLMAELLAVPFVLTLRISVGGNMERSCGKLPAPLSYVPVPMTGLTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWE 
EFYSKALGRPTTLCETVGKAEIWLIRTYWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIVVFSLG 
SLFQNVTEEKANIIASALAQIPQKVLWRYKGKKPSTLGANTRLYDWIPQNDLLGHPKTKAFITHGGMNGIYEAIYHG 
VPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKENAMRLSRIHHDQPVKPLDRAVFWI 
EFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLLTCVATAIFLFTKCFLFSCQKFNKTRKIEKRE 

Signal sequence, 
amino acids 1-21 

Transmembrane domain, 
amino acids 489-509 

N-glycosylation sites. 

amino acids 131-134, 313-316, 518-521 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 67-70, 340-343 

Tyrosine kinase phosphorylation sites, 
amino acids 122-131, 136-144 

N-rayristoylation sites. 

amino acids 19-24, 276-281, 373-378, 377-382 

Amidation site, 
amino acids 338-341 
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MQADARAFFAQDGVQSLLTQKWWGDMASTTPIWALVLAFFCPPLIYTRLITFRKSEEEPTREELEFDMDSVINGEGP 
VGTADPAEKTPLGVPRQSGRPGCCGGRCGGRRCLRRWFHFWGAPVTIFMGNVVSYLLFLLLFSRVLLVDFQPAPPGS 
LELLLYFWAFTLLCEELRQGLSGGGGSLASGGPGPGHASLSQRLRLYLADSWNQCDLVALTCFLLGVGCRLTPGLYH 
LGRTVLCIDFMVFTVRLLHIFTVNKQLGPKIVIVSKMMKDVFFFLFFLGVWLVAYGVATEGLLRPRDSDFPSILRRV 
FYRPYLQIFGQIPQEDMDVALMEHSNCSSEPGFWAHPPGAQAGTCVSQYANWLWLLLVIFLLVANILLVNLLIAMF 
SYTFGKVQGNSDLYWKAQVTASSGNSTLGPRWPRPLSSSPTCASCSGNCAGDPGAPSRPPRPSSISGFTFLRKPSGS 
ANVGIGA 

Transmembrane domains . 

amino acids 26-46, 118-138, 203-223, 233-253, 269-289, 358-378 

N-glycosylation sites, 
amino acids 334-337, 410-413 

Glycosaminoglycan attachment site, 
amino acids 176-179 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 457-460 

N-myristoylation sites. 

amino acids 99-104, 102-107, 106-111, 119-124, 127-132, 174-179, 177-182, 179- 
184, 180-185, 347-352, 351-356, 432-437 

Amidation site, 
amino acids 106-109 



Leucine zipper pattern, 
amino acids 211-232 
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FIGURE 85 

MTLWNGVLPFYPQPRHAAGFSVPLLIVILVFLALAASFLLILPGIRGHSRWFWLVRVLLSLFIGAEIVAVHFSAEWF 
VGTVNTNTSYKAFSAARVTARVGLLVGLEGINITLTGTPVHQLNETIDYNEQFTWRLKENYAAEYANALEKGLPDPV 
LYLAEKFTPSSPCGLYHQYHLAGHYASATLWVAFCFWLLSNVLLSTPAPLYGGLALLTTGAFALFGVFALASISSVP 
LCPLRLGSSALTTQYGAAFWVTLATGVLCLFLGGAWSLQYVRPSALRTLLDQSAKDCSQERGGSPLILGDPLHKQA 
ALPDLKCITTNL 

Transmembrane domains . 

amino acids 19-39, 54-74, 94-114, 178-198, 211-231, 251-271 

N-glycosylation sites. 

amino acids 84-87, 109-112, 121-124 

Tyrosine kinase phosphorylation site, 
amino acids 148-157 

N-myristoylation sites. 

amino acids 79-84, 100-105, 107-112, 265-270 

Leucine zipper pattern, 
amino acids 197-218 
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FIGURE 86 

MERLWGLFQRAQQLSPRSSQTVYQRVEGPRKGHLEEEEEDGEEGAETLAHFCPMELRGPEPLGSRPRQPNLIPWAAA 
GRRAAPYLVLTALLIFTGAFLLGYVAFRGSCQACGDSVLVVSEDVNYEPDLDFHQGRLYWSDLQAMFLQFLGEGRLE 
DTIRQTSLRERVAGSAGMAALTQDIRAALSRQKLDHVWTDTHYVGLQFPDPAHPNTLHWVDEAGKVGEQLPLEDPDV 
YCPYSAIGNVTGELVYAHYGRPEDLQDLRARGVDPVGRLLLVRVGVISFAQKVTNAQDFGAQGVLIYPEPADFSQDP 
PKPSLSSQQAVYGHVHLGTGDPYTPGFPSFNQTQFPPVASSGLPSIPAQPISADIASRLLRKLKGPVAPQEWQGSLL 
GSPYHLGPGPRLRLVVNNHRTSTPINNIFGCIEGRSEPDHYVVIGAQRDAWGPGAAKSAVGTAILLELVRTFSSMVS 
NGFRPRRSLLFISWDGGDFGSVGSTEWLEGYLSVLHLKAWYVSLDNAVLGDDKFHAKTSPLLTSLIESVLKQVDSP 
NHSGQTLYEQVVFTNPSWDAEVIRPLPMDSSAYSFTAFVGVPAVEFSFMEDDQAYPFLHTKEDTYENLHKVLQGRLP 
AVAQAVAQLAGQLLIRLSHDRLLPLDFGRYGDVVLRHIGNLNEFSGDLKARGLTLQWVYSARGDYIRAAEKLRQEIY 
SSEERDERLTRMYNVRIMRVEFYFLSQYVSPADSPFRHIFMGRGDHTLGALLDHLRLLRSNSSGTPGATSSTGFQES 
RFRRQLALLTWTLQGAANALSGDVWNIDNNF 

Transmembrane domain, 
amino acids 79-99 

N-glycosylation sites. 

amino acids 240-243, 339-342, 540-543, 754-757 
N-myristoylation sites. 

amino acids 106-111, 239-244, 382-387, 415-420, 439-444, 478-483, 482-487, 757- 
762, 760-765, 785-790 

Amidation site, 
amino acids 77-80 

Cell attachment sequences, 
amino acids 678-680, 736-738 

PA domain. 

amino acids 236-335 
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FIGURE 87 

MVELMFPLLLLLLPFLLYMAAPQIRKMLSSGVCTSTVQLPGKVVVVTGANTGIGKETAKELAQRGARVYLACRDVEK 
GELVAKEIQTTTGNQQVLVRKLDLSDTKSIRAFAKGFLAEEKHLHVLINNAGVMMCPYSKTADGFEMHIGVNHLGHF 
LLTHLLLEKLKESAPSRIVNVSSLAHHLGRIHFHNLQGEKFYNAGLAYCHSKLANILFTQELARRLKGSGVTTYSVH 
PGTVQSELVRHSSFMRWMWWLFSFFIKTPQQGAQTSLHCALTEGLEILSGNHFSDCHVAWVSAQARNETIARRLWDV 
SCDLLGLPID 

Signal sequence, 
amino acids 1-20 

Transmembrane domain, 
amino acids 32-52 

N-glycosylation sites, 
amino acids 174-177, 298-301 

N-myristoylation sites. 

amino acids 31-36, 48-53, 199-204, 222-227, 233-238, 263-268, 281-286 

Short chain dehydrogenase, 
amino acids 40-303 
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FIGURE 88 

MKIFLPVLLAALLGVERASSLMCFSCLNQKSNLYCLKPTICSDQDNYCVTVSASAGIGNLVTFGHSLSKTCSPACPI 
PEGVNVGVASMGISCCQSFLCNFSAADGGLRASVTLLGAGLLLSLLPALLRFGP 

Signal sequence, 
amino acids 1-20 

Transmembrane domain, 
amino acids 108-128 

N-glycosylation site, 
amino acids 99-102 

N-myristoylation sites. 

amino acids 14-19, 58-63, 80-85, 89-94, 105-110, 106-111, 117-122 

u-PAR/Ly-6 domain, 
amino acids 21-100 
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FIGURE 89 

MLLLKKHTEDISSVYEIRERLGSGAFSEVVLAQERGSAHLVALKCIPKPCALRGKEALVENEIAVLRRISHPNIVALE 
DVHESPSHLYLAMELVTGGELFDRIMERGSYTEKDASHLVGQVLGAVSYLHSLGIVHRDLKPENLLYATPFEDSKIM 
VSDFGLSKIQAGNMLGTACGTPGYVAPELLEQKPYGKAVDVWALGVISYILLCGYPPFYDESDPELFSQILRASYEF 
DXPFWDDISESGKDFIRHLLERDLQKRFTCQQALRDLWIFWDTGFGRDILGFVSEQIRKNFAWTHWKRAFNATLFLR 
HIRKLGQIPEGEGASEQGMXRHSHXGLRAGQPPKW 

N-glycosylation site, 
amino acids 302-305 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 5-8, 66-69, 257-260 

Tyrosine kinase phosphorylation site, 
amino acids 101-109 

N-myristoylation sites. 

amino acids 118-123, 166-171, 170-175, 334-339 

Serine/Threonine protein kinases active-site signature, 
amino acids 132-145 

Protein kinase domain, 
amino acids 15-270 



101/170 



FIGURE 90 

MEELQDDYEDMMEENLEQEEYEDPDIPESQMEEPAAHDTEATATDYHTTSHPGTHEVYVELQELVMDEKNQELRWME 
AARWVQLEENLGENGAWGRPHLSHLTFWSLLELRRVFTKGTVLLDLQETSLAGVANQLLDRFIFEDQIRPQDREELL 
RALLLKHSHAGELEALGGVKPAVLTRSGDPSQPLLPQHSSLETQLFCEQGDGGTEGHSPSGILEKIPPDSEATLVLV 
GRADFLEQPVLGFVRLQEAAELEAVELPVPIRFLFVLLGPEAPHIDYTQLGRAAATLMSERVFRIDAYMAQSRGELL 
HSLEGFLDCSLVLPPTDAPSEQALLSLVPVQRELLRRRYQSSPAKPDSSFYKGLDLNGGPDDPLQQTGQLFGGLVRD 
IRRRYPYYLSDITDAFSPQVLAAVIFIYFAALSPAITFGGLLGEKTRNQMGVSELLISTAVQGILFALLGAQPLLVV 
GFSGPLLVFEEAFFSFCETNGLEYIVGRVWIGFWLILLWLVVAFEGSFLVRFISRYTQEIFSFLISLI FIYETFSK 
LIKIFQDHPLQKTYNYNVLMVPKPQGPLPNTALLSLVLMAGTFFFAMMLRKFKNSSYFPGKLRRVIGDFGVPISILI 
MVLVDFFIQDTYTQKLSVPDGFKVSNSSARGWVIHPLGLRSEFPIWMMFASALPALLVFILIFLESQITTLIVSKPE 
RKMVKGSGFHLDLLLVVGMGGVAALFGMPWLSATTVRSVTHANALTVMGKASTPGAAAQIQEVKEQRISGLLVAVLV 
GLSILMEPILSRIPLAVLFGIFLYMGVTSLSGIQLFDRILLLFKPPKYHPDVPYVKRVKTWRMHLFTGIQIICLAVL 
WVVKSTPASLALPFVLILTVPLRRVLLPLIFRNVELQCLDADDAKATFDEEEGRDEYDEVAMPV 

Transmembrane domains . 

amino acids 401-421, 445-465, 487-507, 519-539, 564-584, 602-622, 663-683, 700- 
720, 761-781, 793-813, 831-851, 854-874 

N-glycosylation site. 

amino acids 593-596, 642-645 

Tyrosine kinase phosphorylation site, 
amino acids 817-824 

N-myristoylation sites. 

amino acids 172-177, 206-211, 361-366, 376-381, 424-429, 448-453, 609-614, 713- 
718, 763-768 

Anion exchangers family signature 1. 
amino acids 379-390 

Anion exchangers family signature 2. 
amino acids 526-540 
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FIGURE 91 

MVAWVVSTGVAIACCAAVYYLAEYNLEFLKTHSNPGAVLLLPFVVSCINLAVPCI YSMFRLVERYEMPRHEVYVLLI 
RNIFLKISIIGILCYYWLNTVALSGEECWETLIGQDIYRLLLMDFVFSLVNSFLGEFLRRIIGMQLITSLGLQEFDI 
ARNVLELIYAQTLVWIGIFFCPLLPFIQMIMLFIMFYSKNISLMMNFQPPSKAWRASQMMTFFIFLLFFPSFTGVLC 
TLAITIWRLKPSADCGPFRGLPLFIHSIYSWIDTLSTRPGYLWVVWIYRNLIGSVHFFFILTLIVLIITYLYWQITE 
GRKIMIRLLHEQIINEGKDKMFLIEKLIKLQDMEKKANPSSLVLERREVEQQGFLHLGEHDGSLDLRSRRSVQEGNPRA 

Transmembrane domains. 

amino acids 1-16, 34-54, 75-95, 114-134, 159-179, 175-195, 213-233, 250-270, 
281-301 

N-glycosylation site, 
amino acids 194-197 

Tyrosine kinase phosphorylation site, 
amino acids 156-163 

N-myristoylation sites, 
amino acids 9-14, 228-233 

Amidation site, 
amino acids 308-311 
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FIGURE 92 

MERRRLWGSIQSRYISMSVWTSPRRLVELAGQSLLKDEALAIAALELLPRELFPPLFMAAFDGRHSQTLKAMVQAWP 
FTCLPLGVLMKGQHLHLETFKAVLDGLDVLLAQEVRPRRWKLQVLDLRKNSHQDFWTVWSGNRASLYSFPEPEAAQP 
MTKKRKVDGLSTEAEQPFIPVEVLVDLFLKEGACDELFSYLIEKVKRKKNVLRLCCKKLKIFAMPMQDIKMILKMVQ 
LDSIEDLEVTCTWKLPTLAKFSPYLGQMINLRRLLLSHIHASSYISPEKEEQYIAQFTSQFLSLQCLQALYVDSLFF 
LRGRLDQLLRHVMNPLETLSITNCRLSEGDVMHLSQSPSVSQLSVLSLSGVMLTDVSPEPLQALLERASATLQDLVF 
DECGITDDQLLALLPSLSHCSQLTTLSFYGNSISISALQSLLQHLIGLSNLTHVLYPVPLESYEDIHGTLHLERLAY 
LHARLRELLCELGRPSMVWLSANPCPHCGDRTFYDPEPILCPCFMPN 

N-glycosylation site, 
amino acids 4 35-4 38 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 125-128 

N-myristoylation sites. 

amino acids 8-13, 138-143, 257-262, 358-363, 415-418 
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FIGURE 93 

MTQRSIAGPICNLKFVTLLVALSSELPFLGAGVQLQDNGYNGLLIAINPQVPENQNLISNIKEMITEASFYLFNATK 
RRVFFRNIKILI PATWKANNNSKIKQESYEKANVIVTDWYGAHGDDPYTLQYRGCGKEGKYIHFTPNFLLNDNLTAG 
YGSRGRVFVHEWAHLRWGVFDEYINDKPFYINGQNQIKVTRCSSDITGIFVCEKGPCPQENCIISKLFKEGCTFIYN 
STQNATASIMFMQSLSSWEFCNASTHNQEAPNLQNQMCSLRSAWDVITDSADFHHSFPMNGTELPPPPTFSLVQAG 
DKVVCLVLDVSSKMAEADRLLQLQQAAEFYLMQIVEIHTFVGIASFDSKGEIRAQLHQINSNDDRKLLVSYLPTTVS 
AKTDI SICSGLKKGFEVVEKLNGKAYGSVMILVTSGDDKLLGNCLPTVLSSGSTIHSIALGSSAAPNLEELSRLTGG 
LKFFVPDISNSNSMIDAFSRISSGTGDIFQQHIQLESTGENVKPHHQLKNTVTVDNTVGNDTMFLVTWQASGPPEII 
LFDPDGRKYYTNNFITNLTFRTASLWIPGTAKPGHWTYTLNNTHHSLQALKVTVTSRASNSAVPPATVEAFVERDSL 
HFPHPVMIYANVKQGFYPILNATVTATVEPETGDPVTLRLLDDGAGADVIKNDGIYSRYFFSFAANGRYSLKVHVNH 
SPSISTPAHSIPGSHAMYVPGYTANGNIQMNAPRKSVGRNEEERKWGFSRVSSGGSFSVLGVPAGPHPDVFPPCKII 
DLEAVKVEEELTLSWTAPGEDFDQGQATSYEIRMSKSLQNIQDDFNNAILVNTSKRNPQQAGIREIFTFSPQISTNG 
PEHQPNGETHESHRIYVAIRAMDRNSLQSAVSNIAQAPLFIPPNSDPVPARDYLILKGVLTAMGLIGIICLIIVVTH 
HTLSRKKRADKKENGTKLL 

Signal sequence, 
amino acids 1-31 

Transmembrane domain, 
amino acids 903-923 

N-glycosylation sites. 

amino acids 74-77, 97-100, 150-153, 231-234, 235-238, 254-257, 292-295, 522-525, 
556-559, 580-583, 637-640, 822-825, 938-941 

Glycosaminoglycan attachment site, 
amino acids 485-488 

N-myristoylation sites. 

amino acids 42-47, 202-207, 395-400, 747-752, 795-800, 905-910 

Amidation site, 
amino acids 544-547 
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FIGURE 94 



MALNSGSPPAIGPYYENHGYQPENPYPAQPTVVPTVYEVHPAQYYPSPVPQYAPRVLTQASNPVVCTQPKSPSGTVC 
TSKTKKALCITLTLGTFLVGAALAAGLLWKFMGSKCSNSGIECDSSGTCINPSNWCDGVSHCPGGEDENRCVRLYGP 
NFILQMYSSQRKSWHPVCQDDWNENYGRAACRDMGYKNNFYSSQGIVDDSGSTSFMKLNTSAGNVDIYKKLYHSDAC 
SSKAVVSLRCLACGVNLNSSRQSRIVGGESALPGAWPWQVSLHVQNVHVCGGSIITPEWIVTAAHCVEKPLNNPWHW 
TAFAGILRQSFMFYGAGYQVQKVISHPNYDSKTKNNDIALMKLQKPLTFNDLVKPVCLPNPGMMLQPEQLCWISGWG 
ATEEKGKTSEVLNAAKVLLIETQRCNSRYVYDNLITPAMICAGFLQGNVDSCQGDSGGPLVTSNNNIWWLIGDTSWG 
SGCAKAYRPGVYGNVMVFTDWIYRQMKANG 

Transmembrane domain, 
amino acids 86-106 

N-glycosylation sites, 
amino acids 213-216, 249-252 

Glycosaminoglycan attachment site, 
amino acids 382-385 

N-myristoylation sites. 

amino acids 6-11, 74-79, 97-102, 110-115, 245-250, 258-263, 432-437, 462-467, 
464-469, 472-477 

ATP/GTP-binding site motif A (P-loop) . 
amino acids 386-393 

Serine proteases, trypsin family, histidine active site, 
amino acids 292-297 



Trypsin homology, 
amino acids 256-484 
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FIGURE 95 

MARSLLLPLQILLLSLALETAGEEAQGDKIIDGAPCARGSHPWQVALLSGNQLHCGGVLVNERWVLTAAHCKMNEYT 
VHLGSDTLGDRRAQRIKASKSFRHPGYSTQTHVNDLMLVKLNSQARLSSMVKKVRLPSRCEPPGTTCTVSGWGTTTS 
PDVTFPSDLMCVDVKLISPQDCTKVYKDLLENSMLCAGIPDSKKNACNGDSGGPLVCRGTLQGLVSWGTFPCGQPND 
PGVYTQVCKFTKWINDTMKKHR 

Signal sequence, 
amino acids 1-15 

N-glycosylation site, 
amino acids 246-249 

Glycosaminoglycan attachment site, 
amino acids 147-150 

N-myristoylation sites. 

amino acids 33-38, 57-62, 141-146, 192-197, 213-218, 222-227 

Serine proteases, trypsin family, histidine active site, 
amino acids 66-71 

Trypsin homology, 
amino acids 30-245 
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FIGURE 96 

MESRKDITNQEELWKMKPRRNLEEDDYLHKDTGETSMLKRPVLLHLHQTAHADEFDCPSELQHTQELFPQWHLPIKI 
AAIIASLTFLYTLLREVIHPLATSHQQYFYKIPILVINKVLPMVSITLLALVYLPGVIAAIVQLHNGTKYKKFPHWL 
DKWMLTRKQFGLLSFFFAVLHAIYSLSYPMRRSYRYKLLNWAYQQVQQNKEDAWIEHDVWRMEIYVSLGIVGLAILA 
LLAVTSIPSVSDSLTWREFHYIQSKLGIVSLLLGTIHALIFAWNKWIDIKQFVWYTPPTFMIAVFLPIVVLIFKSIL 
FLPCLRKKILKIRHGWEDVTKINKTEICSQL 

Transmembrane domains . 

amino acids 72-92, 116-136, 158-178, 218-238, 254-274, 291-311 

N-glycosylation sites, 
amino acids 143-146, 331-334 

Tyrosine kinase phosphorylation site, 
amino acids 19-27 

N-myristoylation sites, 
amino acids 133-138, 265-270 
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FIGURE 97 

MDSSPSLPLIRTPESSLHEALDQCMTALDLFLTNQFSEALSYLKPRTKESMYHSLTYATILEMQAMMTFDPQDILLA 
GNMMKEAQMLCQRHRRKSSVTDSFSSLVNRPTLGQFTEEEIHAEVCYAECLLQRAALTFLQGSSHGGAVRPRALHDP 
SHACSCPPGPGRQHLFLLQDENMVSFIKGGIKVRNSYQTYKELDSLVQSSQYCKGENHPHFEGGVKLGVGAFNLTLS 
MLPTRILRLLEFVGFSGNKDYGLLQLEEGASGHSFRSVLCVMLLLCYHTFLTFVLGTGNVNIEEAEKLLKPYLNRYP 
KGAIFLFFAGRIEVIKGNIDAVSDGGPGRGWGSLGVSQTSRKSGTCDILRDRIDWGRGGGQERTNQRAGAGEALLAE 
QPGKTREEEAFVVPGILTGRYRTAALQWREVEGGA 

Transmembrane domain, 
amino acids 267-287 

N-glycosylation site, 
amino acids 227-230 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 92-95, 93-96 

Tyrosine kinase phosphorylation site, 
amino acids 4 6-52 

N-myristoylation sites. 

amino acids 139-144, 218-223, 325-330, 343-348, 368-373, 377-382, 400-405 

Eukaryotic putative RNA-binding region RNP-1 signature, 
amino acids 309-316 
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FIGURE 98 

MADAEVIILPKKHKKKKERKSLPEEDVAEIQHAEEFLIKPESKVAKLDTSQWPLLLKNFDKLNVRTTHYTPLACGSN 
PLKREIGDYIRTGFINLDKPSNPSSHEVVAWIRRILRVEKTGHSGTLDPKVTGCLIVCIERATRLVKSQQSAGKEYV 
GIVRLHNAIEGGTQLSRALETLTGALFQRPPLIAAVKRQLRVRTIYESKMIEYDPERRLGIFWVSCEAGTYIRTLCV 
HLGLLLGVGGQMQELRRVRSGVMSEKDHMVTMHDVLDAQWLYDNHKDESYLRRVVYPLEKLLTSHKRLVMKDSAVNA 
ICYGAKIMLPGVLRYEDGIEVNQEI VVITTKGEAICMAIALMTTAVISTCDHGIVAKIKRVIMERDTYPRKWGLGPK 
ASQKKLMIKQGLLDKHGKPTDSTPATWKQEYVDYSESAKKEVVAEVVKAPQVVAEAAKTAKRKRESESESDETPPAA 
PQLIKKEKKKSKKDKKAKAGLESGAEPGDGDSDTTKKKKKKKKAKEVELVSE 

Transmembrane domain, 
amino acids 220-240 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 448-451, 470-473 

Tyrosine kinase phosphorylation site, 
amino acids 368-376 

N-myristoylation sites. 

amino acids 166-171, 234-239, 326-331, 482-487, 486-491 

TruB family pseudouridylate synthase homology, 
amino acids 107-247 

PUA domain. 

amino acids 295-370 
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FIGURE 99 

MALRICVTYTPALPIGLCTRCCLCLEQSPSWCHCLRGVSFLTFHLHQSVPLGDRDSLLMFTRQAGHFVEGSKAGRSR 
GRLCLSQALRVAVRGAFVSLWFAAGAGDRERNKGDKGAQTGAGLSQEAEDVDVSRARRVTDAPQGTLCGTGNRNSGS 
QSARAVGVAHLGEAFRVGVEQAISSCPEEVHGRHGLSMEIMWARMDVALRSPGRGLLAGAGALCVTLAESSCPDYER 
GRRACLTLHRHPTPHCSTWGLPLRVAGSWLTVVTVEALGGWRMGVRRTGQVGPTMHPPPVSGASPLLLHHLLLLLLI 
IILTC 

Signal sequence, 
amino acids 1-34 

Transmembrane domains. 

amino acids 80-100, 250-270, 293-312 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 134-137 

N-myristoylation sites. 

amino acids 70-75, 92-97, 114-119, 118-123, 142-147, 148-153, 153-158, 172-177, 
209-214, 258-263, 275-280 

Amidation site, 
amino acids 231-234 

Eukaryotic putative RNA-binding region RNP-1 signature, 
amino acids 36-43 
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FIGURE 100 



MGIVANFQELAVPVVHDGGALLAFVCGVVYTLLQSIISYKSCPQWNSLSTCHIRMVISAVSCAAVIPMIVCASLISI 
TKLEWNPREKDYVYHWSAICEWTVAFGFIFYFLTFIQDFQSVTLRISTEINGDI 

Transmembrane domains . 

amino acids 10-30, 53-73, 93-113 

Tyrosine kinase phosphorylation site, 
amino acids 85-91 

N-myristoylation sites, 
amino acids 2-7, 19-24, 27-32 
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FIGURE 101 

MPRQLSAAAAL FASLAVILHDGSQMRAKAFPETRDYSQPTAAATVQDIKKPVQQPAKQAPHQTLAARFMDGHITFQT 
AATVKIPTTTPATTKNTATTSPITYTLVTTQATPNNSHTAPPVTEVTVGPSLAPYSLPPTITPPAHTAGTSSSTVSH 
TTGNTTQPSNQTTLPATLSIALHKSTTGQKPDQPTHAPGTTAAAHNTTRTAAPASTVPGPTLAPQPSSVKTGIYQVL 
NGSRLCIKAEMGIQLIVQDKESVFSPRRYFNIDPNATQASGNCGTRKSNLLLNFQGGFVNLTFTKDEESYYISEVGA 
YLTVSDPETVYQGIKHAVVMFQTAVGHSFKCVSEQSLQLSAHLQVKTTDVQLQAFDFEDDHFGNVDECSSDYTIVLP 
VIGAIVVGLCLMGMGVYKIRLRCQSSGYQRI 

Signal sequence, 
amino acids 1-16 

Transmembrane domain, 
amino acids 380-400 

N-glycosylation sites. 

amino acids 112-115, 158-161, 164-167, 200-203, 232-235, 266-269, 291-294 

Tyrosine kinase phosphorylation sites, 
amino acids 28-36, 296-302 

N-myristoylation sites. 

amino acids 146-151, 193-198, 233-238, 272-277, 275-280, 287-292, 307-312, 321- 
326 

Lysosome-associated membrane glycoprotein homology, 
amino acids 66-416 
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FIGURE 102 

MTEEPIKEILGAPKAHMAATMEKSPKSEVVITTVPLVSEIQLiyiAATGGTELSCYRCIIPFAVVVFIAGIVVTAVAYS 
FNSHGSIISIFGLVVLSSGLFLLASSALCWKVRQRSKKAKRRESQTALVANQRSLFA 

Transmembrane domains, 
amino acids 54-74, 84-104 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 118-121 

N-myristoylation sites. 

amino acids 11-16, 48-53, 68-73, 82-87 
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FIGURE 103 

MLSLLHASTLAVLGALCVYGAGHLEQPQISSTKTLSKTARLECVVSGITISATSVYWYRERPGEVIQFLVSISYDGT 
VRKESGIPSGKFEVDRIPETSTSTLTIHNVEKQDIATYYCALWEVRLANQELGKKIKVFGPGTKLIITDKQLDADVS 
PKPTIFLPSIAETKLQKAGTYLCLLEKFFPDVIKIHWQEKKSNTILGSQEGNTMKTNDTYMKFSWLTVPEKSLDKEH 
RCIVRHENNKNGVDQEIIFPPIKTDVITMDPKDNCSKDAN DTLLLQLTNTSAYYTYLLLLLKSVVYFAIITCCLLRR 
TAFCCNGEKS 

Signal sequence, 
amino acids 1-17 

Transmembrane domain, 
amino acids 282-302 

N-glycosylation sites. 

amino acids 211-214, 265-268, 271-274, 280-283 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 7 9-82 

N-myristoylation sites. 

amino acids 47-52, 83-88, 173-178, 201-206 

Amidation site, 
amino acids 129-132 

Immunoglobulin domain, 
amino acids 36-119 
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FIGURE 104 

MNYSLHLAFVCLSLFTERMCIQGSQFNVEVGRSDKLSLPGFENLTAGYNKFLRPNFGGEPVQIALTLDIASISSISE 
SNMDYTATIYLRQRWMDQRLVFEGNKSFTLDARLVEFLWVPDTYIVESKKSFLHEVTVGNRLIRLFSNGTVLYALRI 
TTTVACNMDLSKYPMDTQTCKLQLESWGYDGNDVEFTWLRGNDSVRGLEHLRLAQYTIERYFTLVTRSQQETGNYTR 
LVLQFELRRNVLYFILETYVPSTFLVVLSWVSFWISLDSVPARTCIGVTTVLSMTTLMIGSRTSLPNTNCFIKAIDV 
YLGICFSFVFGALLEYAVAHYSSLQQMAAKDRGTTKEVEEVSITNIINSSISSFKRKISFASIEISSDNVDYSDLTM 
KTSDKFKFVFREKMGRIVDYFTIQNPSNVDHYSKLLFPLIFMLANVFYWAYYMYF 

Signal sequence, 
amino acids 1-16 

Transmembrane domains . 
amino acids 306-326, 415-435 

N-glycosylation sites. 

amino acids 2-5, 43-46, 102-105, 145-148, 196-199, 228-231, 356-259 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 364-367 

Tyrosine kinase phosphorylation sites, 
amino acids 175-183, 221-229 

N-myristoylation sites. 

amino acids 23-28, 291-296, 311-316 

Neurotransmitter-gated ion-channels signature, 
amino acids 160-174 

Neurotransmitter-gated ion-channel ligand binding domain, 
amino acids 37-242 

Neurotransmitter-gated ion-channel transmembrane region, 
amino acids 24 9-434 
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FIGURE 105 

MILVLLCAFLIPCPPRDLHSTWSRHLGSQGGGDLSPLELADVNGDGLRDVLLSFVMSRNGSAVGVSRPAANLVCLSG 
MNGSTLWSSLLPEEARDITCLELMPGSLAETICLVTGTHKMLSAFNATSGKAIWTLNPNYLSNGTLAAPVVVLPDLD 
EDGVRDLVVLAIGELQPDLCFLLVSGRTGNPVGRPVKYNIVGVGNLIGPQVYITTNGAVYILFGFGNIQAVALRDIF 
VQAQNRDSSPPSLQIEEPEWEKRRSINLSELIDVYSDGVELLQMVKAPDSNCSNLLITTRQSLVLLRGQNLTPYWAL 
RLQGLRSQPTPGYFTDDQTLDFLLQIQDGVGMKKMMVVDGDSGSIVWSYRAPCHMKETPATSAVTSDQKSVFLFWAE 
GLSAASPNSDIILGTEPPSLHHLYLLHPAFPSILLDLANTTGTVTASEVGINDLWKDAFYVTRTTGPSSEGHPAALV 
VSKLSLRWALMEGQMAQLQESTPKIGRGELRRFLSRIKFVEAPYEI 

Signal sequence, 
amino acids 1-15 

Transmembrane domain, 
amino acids 202-222 

N-glycosylation sites. 

amino acids 59-62, 79-82, 123-126, 140-143, 258-261, 282-285, 424-427 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 253-256 

Tyrosine kinase phosphorylation site, 
amino acids 500-506 

N-myristoylation sites. 

amino acids 27-32, 60-65, 77-82, 183-188, 220-225, 386-391, 427-432 
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FIGURE 106 



MSNKFLGTWKLVSSENFDDYMKALGVGLATRKLGNLAKPTVIISKKGDI ITIRTESTFKNTEISFKLGQEFEETTAD 
NRKTKSIVTLQRGSLNQVQRWDGKETTIKRKLVNGKMVAECKMKGVVCTRIYEKV 

N-myristoylation sites, 
amino acids 25-30, 122-127 

Lipocalin/cytosolic fatty-acid binding protein homology, 
amino acids 4-132 
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FIGURE 107 

MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRVVAPTPGPGTPAERHADGLALALEPALASPAGAANF 
LAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSW 
TGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQ 
MCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTL 
LRLPQKVFDAVVEAVARASLI PEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMM 
GAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQS 
LSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEVVNDESSLVRHRWK 

Signal sequence, 
amino acids 1-20 

Transmembrane domain, 
amino acids 471-491 

N-glycosylation sites, 
amino acids 170-173, 366-369 

Glycosaminoglycan attachment site, 
amino acids 88-91 

N-myristoylation sites. 

amino acids 59-64, 120-125, 152-157, 190-195, 198-203, 236-241, 241-246, 243- 
248, 253-258, 285-290, 339-344, 386-391, 397-396, 438-443 

Leucine zipper patterns, 
amino acids 10-31, 197-218 

Eukaryotic aspartyl protease homology, 
amino acids 41-431 
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FIGURE 108 

^4AVKLGTLLLALALGLAQPASARRKLLVFLLDGFRSDYISDEALESLPGFKEIVSRGVKVDYLTPDFPSLSyPNYyT 
LMTGRHCEVHQMIGNYMWDPTTNKSFDIGVNKDSLMPLWWNGSEPLWVTLTKAKRKVYMYYWPGCEVEILGVRPTYC 
LEYKNVPTDINFANAVSDALDSFKSGRADLAAIYHERIDVEGHHYGPASPQRKDALKAVDTVLKYMTKWIQERGLQD 
RLNVIIFSDHGMTDIFWMDPCVIELNKYISLNDLQQVKDRGPVVSLWPAPGKHSEI YNKLSTVEHMTVYEKEAIPSRF 
YYKKGKFVSPLTLVADEGWFITENREMLPFWMNSTGRREGWQRGWHGYDNELMDMRGIFLAFGPDFKSNFRAAPIRS 
VDVYNVMCNWGITPLPNNGSWSRVMCMLKGRAGTAPPVWPSHCALALILLFLLA 

Signal sequence, 
amino acids 1-22 

Transmembrane domain, 
amino acids 423-439 

N-glycosylation sites. 

amino acids 100-103, 118-121, 341-344, 404-407 

Tyrosine kinase phosphorylation sites, 
amino acids 191-199, 251-258 

N-myristoylation sites, 
amino acids 148-153, 365-370 

Amidation site, 
amino acids 343-346 

Phosphodiest Type I phosphodiesterase, 
amino acids 5-381 
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FIGURE 109 

MKFLAVLVLLGVSIFLVSAQNPTTAAPADTYPATGPADDEAPDAETTAAATTATTAAPTTATTAASTTARKDIPVLP 
KWVGDLPNGRVCP 

Signal sequence, 
amino acids 1-19 
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FIGURE 110 

MFQTGGLIVFYGLLAQTMAQFGGLPVPLDQTLPLNVNPALPLSPTGLAGSLTNALSNGLLSGGLLGILENLPLLDIL 
KPGGGTSGGLLGGLLGKVTSVIPGLNNIIDIKVTDPQLLELGLVQSPDGHRLYVTIPLGIKLQVNTPLVGASLLRLA 
VKLDITAEILAVRDKQERIHLVLGDCTHSPGSLQISLLDGLGPLPIQGLLDSLTGILNKVLPELVQGNVCPLVNEVL 
RGLDITLVHDIVNMLIHGLQFVIKV 

Signal sequence, 
amino acids 1-19 

Transmembrane domain, 
amino acids 134-154 

N-myristoylation sites. 

amino acids 46-51, 49-54, 58-63, 62-67, 66-71, 80-85, 81-86, 82-87, 85-90, 86- 
91, 89-94, 202-207, 233-238 
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FIGURE 111 

MEPPYSLTAHYDEFQEVKYVSRCGAGGARGASLPPGFPLGAARSVTGARSGLPRWNRREVCLLSGLVFAAGLCAILA 
AMLALKYLGPVAAGGGACPEGCPERKAFARAARFLAANLDASIDPCQDFYSFACGGWLRRHAIPDDKLTYGTIAAIG 
EQNEERLRRLLARPGGGPGGAAQRKVRAFFRSCLDMREIERLGPRPMLEVIEDCGGWDLGGAEERPGVAARWDLNRL 
LYKAQGVYSAAALFSLTVSLDDRNSSRYVIRIDQDGLTLPERTLYLAQDEDSEKILAAYRVFMERVLSLLGADAVEQ 
KAQEILQVEQQLANITVSEYDDLRRDVSSMYNKVTLGQLQKITPHLRWKWLLDQIFQEDFSEEEEVVLLATDYMQQV 
SQLIRSTPHRVLHNYLVWRVVVVLSEHLSPPFREALHELAQEMEGSDKPQELARVCLGQANRHFGMALGALFVHEHF 
SAASKAKVQQLVEDIKYILGQRLEELDWMDAETRAAARAKLQYMMVMVGYPDFLLKPDAVDKEYEFEVHEKTYFKNI 
LNSIPFSIQLSVKKIRQEVDKSTWLLPPQALNAYYLPNKNQMVFPAGILQPTLYDPDFPQSLNYGGIGTIIGHELTH 
GYDDWGGQYDRSGNLLHWWTEASYSRFLRKAECIVRLYDNFTVYNQRVNGKHTLGENIADMGVLKLAYHAYQKWVRE 
HGPEHPLPRLKYTHDQLFFIAFAQNWCIKRRSQSIYLQVLTDKHAPEHYRVLGSVSQFEEFGRAFHCPKDSPMNPAH 
KCSVW 

Transmembrane domain, 
amino acids 64-84 

N-glycosylation sites. 

amino acids 255-258, 322-325, 656-659 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 722-725 

N-myristoylation sites. 

amino acids 24-29, 26-31, 27-32, 40-45, 47-52, 65-70, 148-153, 169-174, 170-175, 
237-242, 450-455, 604-609, 607-612 

Prenyl group binding site (CAAX box) . 
amino acids 772-777 

Neutral zinc metallopeptidases, zinc-binding region signature, 
amino acids 609-618 

Peptidase family M13. 
amino acids 571-774 
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FIGURE 112 

MGPSCPVFLSFTKLSLWWLLLTPAGGEEAKRPPPBAPGDPLSSPSPTALPQGGSHTETEDRLFKHLFRGYNRWARPV 
PNTSDWIVRFGLSIAQLI DVDEKNQMMTTNVWLKQEWSDYKLRWNPADFGNITSLRVPSEMIWIPDIVLYNNADGE 
FAVTHMTKAHLFSTGTVHWVPPAIYKSSCSIDVTFFPFDQQNCKMKFGSWTYDKAKIDLEQMEQTVDLKDYWESGEW 
AIVNATGTYNSKKYDCCAEIYPDVTYAFVIRRLPLFYTINLII PCLLISCLTVLVFYLPSDCGEKITLCISVLLSLT 
VFLLLITEIIPSTSLVIPLIGEYLLFTMIFVTLSIVITVFVLNVHHRSPSTHTMPHWVRGALLGCVPRWLLMNRPPP 
PVELCHPLRLKLSPSYHWLESNVDAEEREVVVEEEDRWACAGHVAPSVGTLCSHGHLHSGASGPKAEALLQEGELLL 
SPHMQKALEGVHYIADHLRSEDADSSVKEDWKYVAMVIDRIFLWLFIIVCFLGTIGLFLPPFLAGMI 

Transmembrane domains. 

amino acids 2-22, 264-284, 299-319, 330-350, 501-521 

N-glycosylation sites. 

amino acids 79-82, 129-132, 235-238 

Tyrosine kinase phosphorylation site, 
amino acids 468-475 

N-myristoylation sites. 

amino acids 25-30, 52-57, 89-94, 128-133, 238-243, 368-373, 434-439 

Neurotransmitter-gated ion-channels signature, 
amino acids 183-197 

Neurotransmitter-gated ion-channel ligand binding domain, 
amino acids 59-265 

Neurotransmitter-gated ion-channel transmembrane domain, 
amino acids 272-520 
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FIGURE 113 

MLPPGTATLLTLLLAAGSLGQKPQRPRRPASPISTIQPKANFDAQQFAGTWLLVAVGSACRFLQEQGHRAEATTLHV 
APQGTAMAVSTFRKLDGICWQVRQLYGDTGVLGRFLLQARGARGAVNVVVAETDYQSFAVLYLERAGQLSVKLYARS 
LPVSDSVLSGFEQRVQEAHLTEDQIFYFPKYGFCEAADQFHVLDEVRR 

Signal sequence, 
amino acids 1-20 

N-myristoylation sites, 
amino acids 81-86, 118-123 

Lipocalin/cytosolic fatty-acid binding protein, 
amino acids 46-188 
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FIGURE 114 

MRLTVLCAVCLLPGSLALPLPQEAGGMSELQWEQAQDYLKRFYLYDSETKNANSLEAKLKEMQKFFGLPITGMLNSR 
VIEIMQKPRCGVPDVAEYSLFPNSPKWTSKVVTYRIVSYTRDLPHITVDRLVSKALNMWGKEIPLHFRKVVWGTADI 
MIGFARGAHGDSYPFDGPGNTLAHAFAPGTGLGGDAHFDEDERWTDGSSLGINFLYAATHELGHSLGMGHSSDPNAV 
MYPTYGNGDPQNFKLSQDDIKGIQKLYGKRSNSRKK 

Signal sequence, 
amino acids 1-17 

N-myristoylation sites. 

amino acids 67-72, 72-77, 173-178, 183-188, 201-206, 221-226 

Amidation site, 
amino acids 258-261 

Neutral zinc metallopeptidases , zinc-binding region signature, 
amino acids 211-220 

Matrixins cysteine switch, 
amino acids 85-92 

Matrixin . 

amino acids 38-199 
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FIGURE 115 

MAPPQVLAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNRQCQCTSVGAQNTVICSKLAAKCLVMKAEMNGSK 
LGRRAKPEGALQNNDGLYDPDCDESGLFKAKQCNGTSTCWCVNTAGVRRTDKDTEITCSERVRTYWIIIELKHKARE 
KPYDSKSLRTALQKEITTRYQLDPKFITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEKDVKGESLFHSKKM 
DLTVNGEQLDLDPGQTLIYYVDEKAPEFSMQGLKAGVIAVI VVVVMAVVAGIVVLVISRKKRMAKYEKAEIKEMGEM 
HRELNA 

Signal sequence, 
amino acids 1-23 

Transmembrane domain, 
amino acids 265-285 

N-glycosylation sites. 

amino acids 74-77, 111-114, 198-201 

Tyrosine kinase phosphorylation site, 
amino acids 151-157 

N-myristoylation sites. 

amino acids 10-15, 52-57, 75-80, 86-91, 103-108, 112-117, 123-128, 263-268 

Amidation site, 
amino acids 78-81 

Thyroglobulin type-1 repeat, 
amino acids 66-135 
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FIGURE 116 

MRQKAVSVFLCYLLLFTCSGVEAGKKKCSESSDSGSGFWKALTFMAVGGGLAVAGLPALGFTGAGIAANSVAASLMS 
WSAILNGGGVPAGGLVATLQSLGAGGSSVVIGNIGALMRYATHKYLDSEEDEE 

Signal sequence, 
amino acids 1-20 

Transmembrane domains, 
amino acids 39-59, 90-110 

Glycosaminoglycan attachment site, 
amino acids 34-37 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 26-29 

N-myristoylation sites. 

amino acids 20-25, 48-53, 50-55, 63-68, 65-70, 85-90, 86-91, 90-95, 91-96, 100- 
105, 109-114 



Amidation site, 
amino acids 23-26 
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FIGURE 117 

MPRAPAPLYACLLGLCALLPRLAGLNICTSGSATSCEECLLIHPKCAWCSKEDFGSPRSITSRCDLRANLVKNGCGG 
EIESPASSFHVLRSLPLSSKGSGSAGWDVIQMTPQEIAVNLRPGDKTTFQLQVRQVEDYPVDLYYLMDLSLSMKDDL 
DNIRSLGTKLAEEMRKLTSNFRLGFGSFVDKDISPFSYTAPRYQTNPCIGYKLFPNCVPSFGFRHLLPLTDRVDSFN 
EEVRKQRVSRNRDAPEGGFDAVLQAAVCKEKIGWRKDALHLLVFTTDDVPHIALDGKLGGLVQPHDGQCHLNEANEY 
TASNQMDYPSLALLGEKLAENNINLIFAVTKNHYMLYKNFTALIPGTTVEILDGDSKNIIQLIINAYNSIRSKVELS 
VWDQPEDLNLFFTATCQDGVSYPGQRKCEGLKIGDTASFEVSLEARSCPSRHTEHVFALRPVGFRDSLEVGVTYNCT 
CGCSVGLEPNSARCNGSGTYVCGLCECSPGYLGTRCECQDGENQSVYQNLCREAEGKPLCSGRGDCSCNQCSCFESE 
FGKIYGPFCECDNFSCARNKGVLCSGHGECHCGECKCHAGYIGDNCNCSTDISTCRGRDGQICSERGHCLCGQCQCT 
EPGAFGEMCEKCPTCPDACSTKRDCVECLLLHSGKPDNQTCHSLCRDEVITWVDTIVKDDQEAVLCFYKTAKDCVMM 
FTYVELPSGKSNLTVLREPECGNTPNAMTILLAVVGSILLVGLALLAIWKLLVTIHDRREFAKFQSERSRARYEMAS 
NPLYRKPISTHTVDFTFNKFNKSYNGTVD 

Signal sequence, 
amino acids 1-24 
Transmembrane domain, 
amino acids 723-743 
N-glycosylation sites. 

amino acids 347-350, 460-463, 477-480, 505-508, 552-555, 586-589, 654-657, 705- 
708, 791-794, 795-798 
Glycosaminoglycan attachment sites, 
amino acids 523-526, 564-567 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 169-172 
N-myristoylation sites. 

amino acids 24-29, 31-36, 55-60, 98-103, 161-166, 248-253, 409-414, 415-420, 
456-461, 464-469, 468-473, 480-485, 485-490, 560-565, 599-604, 611-616, 715-720 
Cell attachment sequence, 
amino acids 525-527 

MotEGF-like domain cysteine pattern signatures, 
amino acids 487-498, 574-585 

Integrins beta chain cysteine-rich domain signatures, 
amino acids 522-535, 563-576 
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FIGURE 118 

MLPQIPFLLLVSLNLVHGVFYAERYQMPTGIKGPLPNTKTQFFIPYTIKSKGIAVRGEQGTPGPPGPAGPRGHPGPS 
GPPGKPGYGSPGLQGEPGLPGPPGPSAVGKPGVPGLPGKPGERGPYGPKGDVGPAGLPGPRGPPGPPGIPGPAGISV 
PGKPGQQGPTGAPGPRGFPGEKGAPGVPGMNGQKGEMGYGAPGRPGERGLPGPQGPTGPSGPPGVGKRGENGVPGQP 
GIKGDRGFPGEMGPIGPPGPQGPPGERGPEGIGKPGAAGAPGQPGIPGTKGLPGAPGIAGPPGPPGFGKPGLPGLKG 
ERGPAGLPGGPGAKGEQGPAGLPGKPGLTGPPGNMGPQGPKGIPGSHGLPGPKGETGPAGPAGYPGAKGERGSPGSD 
GKPGYPGKPGLDGPKGNPGLPGPKGDPGVGGPPGLPGPVGPAGAKGMPGHNGEAGPRGAPGIPGTRGPIGPPGIPGF 
PGSKGDPGSPGPPGPAGIATKGLNGPTGPPGPPGPRGHSGEPGLPGPPGPPGPPGQAVMPEGFIKAGQRPSLSGTPL 
VSANQGVTGMPVSAFTVILSKAYPAIGTPIPFDKILYNRQQHYDPRTGIFTCQIPGIYYFSYHVHVKGTHVWVGLYK 
NGTPVMYTYDEYTKGYLDQASGSAIIDLTENDQVWLQLPNAESNGLYSSEYVHSSFSGFLVAPM 

Signal sequence, 
amino acids 1-18 

Tyrosine kinase phosphorylation site, 
amino acids 116-123 

N-myristoylation sites. 

amino acids 18-23, 276-281, 317-322, 350-355, 380-385, 415-420, 446-451, 
529-534, 548-553, 587-592, 613-618, 661-666 

Amidation site, 
amino acids 219-224 

Clq domain signature, 
amino acids 571-601 

Clq domain. 

amino acids 553-677 

Collagen triple helix repeat (20 copies) . 

amino acids 92-150, 156-214, 223-281, 282-340, 344-403, 404-463, 464-522 
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FIGURE 119 

MLLWSLLVIFDAVTEQADSLTLVAPSSVFEGDSIVLKCQGEQNWKIQKMAYHKDNKELSVFKKFSDFLIQSAVLSDS 
GNYFCSTKGQLFLWDKTSNIVKIKVQELFQRPVLTASSFQPIEGGPVSLKCETRLSPQRLDVQLQFCFFRENQVLGS 
GWSSSPELQISAVWSEDTGSYWCKAETVTHRIRKQSLQSQIHVQRIPISNVSLEIRAPGGQVTEGQKLILLCSVAGG 
TGNVTFSWYREATGTSMGKKTQRSLSAELEIPAVKESDAGKYYCRADNGHVPIQSKVVNIPVRIPVSRPVLTLRSPG 
AQAAVGDLLELHCEALRGSPPILYQFYHEDVTLGNSSAPSGGGASFNLSLTAEHSGNYSCEANNGLGAQCSEAVPVS 
ISGPDGYRRDLMTAGVLWGLFGVLGFTGVALLLYALFHKISGESSATNEPRGASRPNPQEFTYSSPTPDMEELQPVY 
VNVGSVDVDWYSQVWSMQQPESSANIRTLLENKDSQVIYSSVKKS 

Signal sequence, 
amino acids 1-14 

Transmembrane domain, 
amino acids 400-420 

N-glycosylation sites. 

amino acids 204-207, 234-237, 343-346, 355-358, 365-368 

Glycosaminoglycan attachment site, 
amino acids 348-351 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 62-65, 187-190 

Tyrosine kinase phosphorylation site, 
amino acids 266-273 

N-myristoylation sites. 

amino acids 78-83, 121-126, 153-158, 173-178, 213-218, 230-235, 245-250, 308- 
313, 349-354, 351-356, 364-369, 375-380, 400-405 

Amidation site, 
amino acids 248-251 

Immunoglobulin domains. 

amino acids 17-84, 121-179, 219-277, 314-370 
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FIGURE 120 

LRQGLSGGGGSLASGGPGPGHASLSQRLRLYLADSWNQCDLVALTCFLLGVGCRLTPGLYHLGRTVLCIDFMVFTVR 
LLHIFTVNKQLGPKIVIVSKMMKDVFFFLFFLGVWLVAYGVATEGLLRPRDSDFPSILRRVFYRPYLQIFGQIPQED 
MDVALMEHSNCSSEPGFWAHPPGAQAGTCVSQYANWLVVLLLVIFLLVANILLVNLLIAMFSYTFGKVQGNSDLYWK 
AQRYRLIREFHSRPALAPPFIVISHLRLLLRQLCRRPRSPQPSSPALEHFRVYLSKEAERKLLTWESVHKENFLLAR 
ARDKRESDSERLKRTSQKVDLALKQLGHIREYEQRLKVLEREVQQCSRVLGWVAEALSRSALLPPGGPPPPDLPGSKD 

Transmembrane domains . 

amino acids 34-54, 63-83, 99-119, 189-209, 238-258 

N-glycosylation site, 
amino acids 164-167 

Glycosaminoglycan attachment site, 
amino acids 6-9 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 312-315, 321-324 

N-myristoylation sites. 

amino acids 4-9, 7-12, 9-14, 10-15, 177-182, 181-186 

Leucine zipper pattern, 
amino acids 41-62 
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FIGURE 121 

MGPSCPVFLSFTKLSLWWLLLTPAGGEEAKRPPPRAPGDPLSSPSPTALPQGGSHTETEDRLFKHLFRGYNRWARPV 
PNTSDVVIVRFGLSIAQLIDVDEKNQMMTTNVWLKQEWSDYKLRWNPADFGNITSLRVPSEMIWIPDIVLYNNADGE 
FAVTHMTKAHLFSTGTVHWVPPAIYKSSCSIDVTFFPFDQQNCKMKFGSWTYDPCAKIDLEQMEQTVDLKDYWESGEW 
AIVNATGTYNSKKYDCCAEIYPDVTYAFVIRRLPLFYTINLIIPCLLISCLTVLVFYLPSDCGEKITLCISVLLSLT 
VFLLLITEIIPSTSLVIPLIGEYLLFTMIFVTLSIVITVFVLNVHHRSPSTHTMPHWVRGALLGCVPRWLLMNRPPP 
PVELCHPLRLKLSPSYHWLESNVDAEEREVVVEEEDRWACAGHVAPSVGTLCSHGHLHSGASGPKAEALLQEGELLL 
SPHMQKALEGVHYIADHLRSEDADSSVKEDWKYVAMVIDRIFLWLFI IVCFLGTIGLFLPPFLAGMI 

Signal sequence, 
amino acids 1-2 6 

Transmembrane domains. 

amino acids 264-284, 299-319, 330-350, 501-521 

N-glycosylation sites. 

amino acids 79-82, 129-132, 235-238 

Tyrosine kinase phosphorylation site, 
amino acids 468-475 

N-myristoylation sites. 

amino acids 25-30, 52-57, 89-94, 128-133, 238-243, 368-373, 434-439 

Neurotransmitter-gated ion-channel signature, 
amino acids 183-197 

Neurotransmitter-gated ion-channel ligand binding domain, 
amino acids 59-265 

Neurotransmitter-gated ion-channel transmembrane region, 
amino acids 272-520 
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FIGURE 122 

MESRKDITNQEELWKMKPRRNLEEDDYLHKDTGETSMLKRPVLLHLHQTAHADEFDCPSELQHTQELFPQWHLPIKI 
AAIIASLTFLYTLLREVIHPLATSHQQYFYKIPILVINKVLPMVSITLLALVYLPGVIAAIVQLHNGTKYKKFPHWL 
DKWMLTRKQFGLLSFFFAVLHAIYSLSYPMRRSYRYKLLNWAYQQVQQNKEDAWIEHDVWRMEIYVSLGIVGLAILA 
LLAVTSIPSVSDSLTWREFHYIQSKLGIVSLLLGTIHALIFAWNKWIDIKQFVWYTPPTFMIAVFLPIVVLIFKSIL 
FLPCLRKKILKIRHGWEDVTKINKTEICSQL 

Transmembrane domains . 

amino acids 72-92, 116-136, 158-178, 218-238, 254-274, 291-311 

N-glycosylation sites, 
amino acids 143-146, 331-334 

Tyrosine kinase phosphorylation site, 
amino acids 19-27 

N-myristoylation sites, 
amino acids 133-138, 265-270 
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FIGURE 123 

MHNTTEKPTDAYGELDFTGAGRKHSNFLRLSDRTDPAAVYSLVTRTWGFRAPNLVVSVLGGSGGPVLQTWLQDLLRR 
GLVRAAQSTGAWIVTGGLHTGIGRHVGVAVRDHQMASTGGTKVVAMGVAPWGVVRNRDTLINPKGSFPARYRWRGDP 
EDGVQFPLDYNYSAFFLVDDGTHGCLGGENRFRLRLESYISQQKTGVGGTGIDIPVLLLLIDGDEKMLTRIENATQA 
QLPCLLVAGSGGAADCLAETLEDTLAPGSGGARQGEARDRIRRFFPKGDLEVLQAQVERIMTRKELLTVYSSEDGSE 
EFETIVLKALVKACGSSEASAYLDELRLAVAWNRVDIAQSELFRGDIQWRSFHLEASLMDALLNDRPEFVRLLISHG 
LSLGHFLTPMRLAQLYSAAPSNSLIRNLLDQASHSAGTKAPALKGGAAELRPPDVGHVLRMLLGKMCAPRYPSGGAW 
DPHPGQGFGESMYLLSDKATSPLSLDAGLGQAPWSDLLLWALLLNRAQMAMYFWEMGSNAVSSALGACLLLRVMARL 
EPDAEEAARRKDLAFKFEGMGVDLFGECYRSSEVRAARLLLRRCPLWGDATCLQLAMQADARAFFAQDGVQSLLTQK 
WWGDMASTTPIWALVLAFFCPPLI YTRLITFRKSEEEPTREELEFDMDSVINGEGPVGTADPAEKTPLGVPRQSGRP 
GCCGGRCGGRRCLRRWFHFWGAPVTIFMGNVVSYLLFLLLFSRVLLVDFQPAPPGSLELLLYFWAFTLLCEELRQGL 
SGGGGSLASGGPGPGHASLSQRLRLYLADSWNQCDLVALTCFLLGVGCRLTPGLYHLGRTVLCIDFMVFTVRLLHIF 
TVNKQLGPKIVIVSKMMKDVFFFLFFLGVWLVAYGVATEGLLRPRDSDFPSILRRVFYRPYLQIFGQIPQEDMDVAL 
MEHSNCSSEPGFWAHPPGAQAGTCVSQYANWLVVLLLVIFLLVANILLVNLLIAMFSYTFGKVQGNSDLYWKAQRYR 
LIREFHSRPALAPPFIVISHLRLLLRQLCRRPRSPQPSSPALEHFRVYLSKEAERKLLTWESVHKENFLLARARDKR 
ES DSERLERT SQKVDLALKQLGH I RE YEQRLKVLEREVQQCS RVLGWVT 

Transmembrane domains. 

amino acids 621-641, 713-733, 798-818, 828-848, 864-884, 954-974, 1003-1023 
N-glycosylation sites. 

amino acids 3-6, 165-168, 227-230, 929-932 
Glycosaminoglycan attachment site, 
amino acids 771-774 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 22-25, 1077-1080 
N-myristoylation sites. 

amino acids 78-83, 93-98, 94-99, 129-134, 142-147, 175-180, 178-183, 200-205, 

240-245, 243-248, 259-264, 262-267, 323-328, 385-390, 467-472, 694-699, 697-702, 

701-706, 714-719, 722-727, 769-774, 772-777, 774-779, 775-780, 942-947, 946-951 

Amidation sites . 

amino acids 20-23, 701-704 

Cell attachment sequences. 

amino acids 151-153, 352-354 

Leucine zipper pattern. 

amino acids 806-827 
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FIGURE 124 



MHLHRRFTDLIWKNLCPALIVILGNPIHDKTITSAHTSSTSTSLESDSASPGVSDHGRGSGCSCTAPALSGPVARTI 
YYIAAELVRLVGSVDSMKPVLQSLYHRVLLYPPPQHRVEAIKIMKEILGSPQRLCDLAGPSSTESESRKRSISKRKS 
HLDLLKLIMDGMTEACIKGGIEACYAAVSCVCTLLGALDELSQGKGLSEGQVQLLLLRLEELKDGAEWSRDSMEINE 
ADFRWQRRVLSSEHTPWESGNERSLDISISVTTDTGQTTLEGELGQTTPEDHSGNHKNSLKSPAIPEGKETLSKVLE 
TEAVDQPDVVQRSHTVPYPDITNFLSVDCRTRSYGSRYSESNFSVDDQDLSRTEFDSCDQYSMAAEKDSGRSDVSDI 
GSDNCSLADEEQTPRDCLGHRSLRTAALSLKLLKNQEADQHSARLFIQSLEGLLPRLLSLSNVEEVDTALQNFASTF 
CSGMMHSPGFDGNSSLSFQMLMNADSLYTAAHCALLLNLKLSHGDYYRKRPTLAPGVMKDFMKQVQTSGVLMVFSQA 
WIEELYHQVLDRNMLGEAGYWGSPEDNSLPLITMLTDIDGLESSAIGGQLiVIASAATESPFAQSRRIDDSTVAGVAFA 
RYILVGCWKNLIDTLSTPLTGRMAGSSKGLAFILGAEGIKEQNQKERDAICMSLDGLRKAARLSCALGVAANCASAL 
AQMAAASCVQEEKEEREAQEPSDAITQVKLKVEQKLEQIGKVQGVWLHTAHVLCMEAILSVGLEMGSHNPDCWPHVF 
RVCEYVGTLEHNHFSDGASQPPLTISQPQKATGSAGLLGDPECEGSPPEHSPEQGRSLSTAPVVQPLSIQDLVREGS 
RGRASDFRGGSLMSGSSAAKVVLTLSTQADRLFEDATDKLNLMALGGFLYQLKKASQSQLFHSVTDTVDYSLAMPGE 
VKSTQDRKSALHLFRLGNAMLRIVRSKARPLLHVMRCWSLVAPHLVEAACHKERHVSQKAVSFIHDILTEVLTDWNE 
PPHFHFNEALFRPFERIMQLELCDEDVQDQVVTSIGELVEVCSTQIQSGWRPLFSALETVHGGNKSEMKEYLVGDYS 
MGKGQAPVFDVFEAFLNTDNIQVFANAATSYIMCLMKFVKGLGEVDCKEIGDCAPAPGAPSTDLCLPALDYLRRCSQ 
LLAKIYKMPLKPIFLSGRLAGLPRRLQEQSASSEDGIESVLSDFDDDTGLIEVWIILLEQLTAAVSNCPRQHQPPTL 
DLLFELLRDVTKTPGPGFGIYAVVHLLLPVMSVWLRRSHKDHSYWDMASANFKHAIGLSCELVVEHIQSFLHSDIRY 
ESMINTMLKDLFELLVACVAKPTETISRVGCSCIRYVLVTAGPVFTEEMWRLACCALQDAFSATLKPVKDLLGCFHS 
GTESFSGEGCQVRVAAPSSSPSAEAEYWRIRAMAQQVFMLDTQCSPKTPNNFDHAQSCQLIIELPPDEKPNGHTKKS 
VSFREI VVSLLSHQVLLQNLYDILLEEFVKGPSPGEEKTIQVPEAKLAGFLRYISMQNLAVIFDLLLDSYRTAREFD 
TSPGLKCLLKKVSGIGGAANLYRQSAMSFNI YFHALVCAVLTNQETITAEQVKKVLFEDDERSTDSSQQCSSEDEDI 
FEETAQVSPPRGKEKRQWRARMPLLSVQPVSNADWVWLVKRLHKLCMELCNNYIQMHLDLENCMEEPPIFKGDPFFI 
LPSFQSESSTPSTGGFSGKETPSEDDRSQSREHMGESLSLKAGGGDLLLPPSPKVEKKDPSRKKEWWENAGNKIYTM 
AADKTISKLMTEYKKRKQQHNLSAFPKEVKVEKKGEPLGPRGQDSPLLQRPQHLMDQGQMRHSFSAGPELLRQDKRP 
RSGSTGSSLSVSVRDAEAQIQAWTNMVLTVLNQIQILPDQTFTALQPAVFPCISQLTCHVTDIRVRQAVREWLGRVG 
RVYDIIV 



Transmembrane domains. 

amino acids 482-502, 1243-1263, 1510-1530 
N-glycosylation sites. 

amino acids 350-353, 389-392, 475-478, 1065-1068, 1792-1795 
Glycosaminoglycan attachment sites, 
amino acids 1392-1395, 1553-1556 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 5-8, 145-148, 151-154, 511-514, 900-903, 1151-1154, 1550-1153 

Tyrosine kinase phosphorylation site. 

amino acids 172-179, 1269-1276 

N-myristoylation sites. 

amino acids 59-64, 61-66, 89-94, 165-170, 173-178, 174-179, 200-205, 219-224, 
251-256, 285-290, 343-348, 386-391, 579-584, 587-592, 612-617, 622-627, 641-646, 
672-677, 684-689, 755-760, 857-862, 862-867, 1063-1068, 1136-1141, 1382-1387, 
1554-1559, 1556-1561, 1708-1713, 1851-1856, 1854-1859 
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FIGURE 125 

MVDPNGNESSATYFILIGLPGLEEAQFWLAFPLCSLYLIAVLGNLTI IYI VRTEHSLHEPMYIFLCMLSGIDILIST 
SSMPKMLAIFWFNSTTIQFDACLLQIFAIHSLSGMESTVLLAMAFDRYVAICHPLRHATVLTLPRVTKIGVAAVVRG 
AALMAPLPVFIKQLPFCRSNILSHSYCLHQDVMKLACDDIRVNVVYGLIVIISAIGLDSLLISFSYLLILKTVLGLT 
REAQAKAFGTCVSHVCAVFIFYVPFIGLSMVHRFSKRRDSPLPVILANI YLLVPPVLNPIVYGVKTKEIRQRILRLF 
HVATHASEP 

Transmembrane domains. 

amino acids 27-47, 61-81, 99-119, 141-161, 205-225, 239-259, 273-293 

N-glycosylation sites. 

amino acids 7-10, 44-47, 90-93 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 268-271 

N-myristoylation sites. 

amino acids 6-11, 21-26, 111-116, 240-245 



7 transmembrane receptor 
amino acids 43-293 



(rhodopsin family) . 
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FIGURE 126 

MGSNSGQAGRHIYKSLADDGPFDSVEPPKRPTSRLIMHSMAMFGREFCYAVEAAYVTPVLLSVGLPSSLYSIVWFLS 
PILGFLLQPVVGSASDHCRSRWGRRRPYILTLGVMMLVGMALYLNGATVVAALIANPRRKLVWAISVTMIGVVLFDF 
AADFIDGPIKAYLFDVCSHQDKEKGLHYHALFTGFGGALGYLLGAIDWAHLELGRLLGTEFQVMFFFSALVLTLCFT 
VHLCSISEAPLTEVAKGIPPQQTPQDPPLSSDGMYEYGSIEKVKNGYVNPELAMQGAKNKNHAEQTRRAMTLKSLLR 
ALVNMPPHYRYLCISHLIGWTAFLSNMLFFTDFMGQIVYRGDPYSAHNSTEFLIYERGVEVGCWGFCINSVFSSLYS 
YFQKVLVSYIGLKGLYFTGYLLFGLGTGFIGLFPNVYSTLVLCSLFGVMSSTLYTVPFNLITEYHREEEKERQQAPG 
GDPDNSVRGKGMDCATLTCMVQLAQILVGGGLGFLVNTAGTVVVVVITASAVALIGCCFVALFVRYVD 

Transmembrane domains . 

amino acids 68-88, 105-125, 141-161, 182-202, 216-236, 318-338, 394-414, 422- 
442, 501-521 

N-glycosylation site, 
amino acids 356-359 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 29-32 

N-myristoylation sites. 

amino acids 2-7, 64-69, 190-195, 366-371, 399-404, 409-414, 416-421, 432-437, 
473-478, 491-496 

Amidation site, 
amino acids 99-102 

Cell attachment sequence, 
amino acids 348-350 
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FIGURE 127 

MRLIRNIYIFCSIVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQLQSSDFHSVSKLRVL 
ILCHNRIQQLDLKTFEFNKELRYLDLSNNRLKSVTWYLLAGLRYLDLSFNDFDTMPICEEAGNMSHLEILGLSGAKI 
QKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLPILNTTKLHIVLPMDTNFWVLLRDGIKTSKILEMTNIDGKSQFVSY 
EMQRNLSLENAKTSVLLLNKVDLLWDDLFLILQFVWHTSVEHFQIRNVTFGGKAYLDHNSFDYSNTVMRTIKLEHVH 
FRVFYIQQDKIYLLLTKMDIENLTISNAQMPHMLFPNYPTKFQYLNFANNILTDELFKRTIQLPHLKTLILNGNKLE 
TLSLVSCFANNTPLEHLDLSQNLLQHKNDENCSWPETVVNMNLSYNKLSDSVFRCLPKSIQILDLNNNQIQTVPKET 
IHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIEMNFILSPSLDFVQSCQEVKTLNAGRNPFRCTCELKNFIQLETY 
SEVMMVGWSDSYTCEYPLNLRGTRLKDVHLHELSCNTALLIVTIVVIMLVLGLAVAFCCLHFDLPWYLRMLGQCTQT 
WHRVRKTTQEQLKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDGSILICLYESYFDPGKSISENIVSFIEKSYKSI 
FVLSPNFVQNEWCHYEFYFAHHNLFHENSDHIILILLEPIPFYCIPTRYHKLPCALLEKKAYLEWPKDRRKCGLFWAN 
LRAAINVNVLATREMYELQTFTELNEESRGSTISLMRTDCL 

Transmembrane domain, 
amino acids 577-597 
N-glycosylation sites. 

amino acids 33-36, 36-39, 140-143, 189-192, 236-239, 278-281, 330-333, 416-419, 
427-430 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 621-624 

Tyrosine kinase phosphorylation site, 
amino acids 74 6-754 
N-myristoylation sites. 

amino acids 148-153, 210-215, 591-596, 765-770, 800-805 

Leucine zipper patterns. 

amino acids 39-60, 101-122 

Nt-dnaJ domain signature. 

amino acids 350-369 

TIR domain. 

amino acids 636-774 

Leucine rich repeats. 

amino acids 49-72, 73-96, 97-120, 143-166, 373-394, 398-418, 422-442, 444-466, 
467-488, 489-512 

Leucine rich repeat C-terminal domain, 
amino acids 522-575 
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FIGURE 128 



MTRKRTYWVPNSSGGLVNRGIDIGDDMVSGLIYKTYTLQDGPWSQQERNPEAPGRAAVPPWGKYDAALRTMIPFRPK 
PRFPAPQPLDNAGLFSYLTVSWLTPLMIQSLRSRLDENTI PPLSVHDASDKNVQRLHRLWEEEVSRRGIEKASVLLV 
MLRFQRTRLIFDALLGICFCIASVLGPILIIPKILEYSEEQLGNVVHGVGLCFALFLSECVKSLSFSSSWIINQRTA 
IRFRAAVSSFAFEKLIQFKSVIHITSGEAISFFTGDVNYLFEGVCYGPLVLITCASLVICSISSYFI IGYTAFIAIL 
CYPLVFPLEVFMTRMAVKAQHHTSEVSDQRIRVTSEVLTCIKLIKMYTWEKPFAKIIEDLRRKERKLLEKCGLVQSL 
TSITLFIIPAVATAVWVLIHTSLKLKLTASMAFSMLASLNLLRLSVFFVPIAVKGLTNSKSAVMRFKKFFLQESPVF 
YVQTLQDPSKALVFEEATLSWQQTCPGIVNGALELERNGHASEGMTRPRDALGPEEEGNSLGPELHKINLVVSKGMM 
LGVCGNTGSGKSSLLSAILEEMHLLEGSVGVQGSLAYVPQQAWIVSGNIRENILMGGAYDKARYLQVLHCCSLNRDL 
ELLPFGDMTEIGERGPNLSGGQKQRISLARAVYSDRQI YLLDDPLSAVDAHVGKHI FEECIKKTLRGKTVVQVTHQL 
QYLEFCGQVILLENGKICENGTHSELMQKKGKYAQLIQKMHKEATSDMLQDTAKIAEKPKVESQALATSLEESLNGN 
AVPEHQLTQEEEMEEGSLSWRVYHHYIQAAGGYMVSCIIFFFVVLIVFLTIFSFWWLSYWLEQGSGTNSSRESNGTM 
ADLGNIADNPQLSFYQLVYGLNALLLICVGVCSSGIFTKVTRKASTALHNKLFNKVFRCPMSFFDTIPIGRLLNCFA 
GDLEQLDQLLPIFSEQFLVLSLMVIAVLLIVSVLSPYILLMGAIIMVICFI YYMMFKEAIGVFKRLENYSRSPLFSH 
ILNSLQGLSSIHVYGKTEDFISQFKRLTDAQNNYLLLFLSSTRWMALRLEIMTNLVTLAVALFVAFGISSTPYSFKV 
MAVNIVLQLASSFQATARIGLETEAQFTAVERILQYMKMCVSEAPLHMEGTSCPQGWPQHGEIIFQDYHMKYRDNTP 
TVLHGINLTIRGHEVVGIVGRTGSGKSSLGMALFRLVEPMAGRILIDGVDICSIGLEDLRSKLSVIPQDPVLLSGTI 
RFNLDPFDRHTDQQIWDALERTFLTKAISKFPKKLHTDVVENGGNFSVGERQLLCIARAVLRNSKIILIDEATASID 
METDTLIQRTIREAFQGCTVLVIAHRVTTVLNCDRILVMGNGKVVEFDRPEVLRKKPGSLFAALMATATSSLR 



Transmembrane domains . 

amino acids 163-183, 199-219, 270-290, 300-320, 381-401, 418-438, 804-824, 858- 
878, 902-922, 935-955, 951-971, 990-1010, 1042-1062 
N-glycosylation sites. 

amino acids 11-14, 633-636, 713-716, 838-841, 844-847, 992-925, 1162-1165, 1277- 
1280 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 3-6, 889-892, 1026-1029 
N-myristoylation sites. 

amino acids 14-19, 20-25, 145-150, 170-175, 202-207, 204-209, 380-385, 440-445, 

489-494, 537-542, 541-546, 544-549, 547-552, 569-574, 834-839, 836-841, 877- 

882, 1160-1165, 1178-1183, 1203-1208, 1275-1280, 1367-1372 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 544-551, 1175-1182 

ABC transporter. 

amino acids 537-708, 1168-1351 

ABC transporter transmembrane region. 

amino acids 163-431, 806-1095 
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FIGURE 129 

MSVMVVRKKVTRKWEKLPGRNTFCCDGRVMMARQKGIFYLTLFLILGTCTLFFAFECRYLAVQLSPAIPVFAAMLFL 
FSMATLLRTSFSDPGVIPRALPDEAAFIEMEIEATNGAVPQGQRPPPRIKNFQINNQIVKLKYCYTCKIFRPPRASH 
CSICDNCVERFDHHCPWVGNCVGKRNYRYFYLFILSLSLLTIYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLI 
CFFTLWSVVGLTGFHTFLVALNQTTNEDIKGSWTGKNRVQNPYSHGNIVKNCCEVLCGPLPPSVLDRRGILPLEESG 
SRPPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 

Transmembrane domains. 

amino acids 32-52, 66-86, 184-204, 227-247 

N-glycosylation site, 
amino acids 253-256 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 8-11 

N-myristoylation sites, 
amino acids 173-178, 262-267 

Amidation site, 
amino acids 176-179 

DHHC zinc finger domain, 
amino acids 130-194 
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FIGURE 130 

MVCGGFACSKNCLCALNLLYTLVSLLLIGIAAWGIGFGLISSLRVVGVVIAVGIFLFLIALVGLIGAVKHHQVLLFF 
YMIILLLVFIVQFSVSCACLALNQEQQGQLLEVGWNNTASARNDIQRNLNCCGFRSVNPNDTCLASCVKSDHSCSPC 
API IGE YAGEVLRFVGGI GLFFS FTE I LGVWLTYRYRNQKDPRAN PSAFL 

Transmembrane domains. 

amino acids 10-30, 43-63, 75-95, 163-183 

N-glycosylation sites, 
amino acids 113-116, 137-140 

N-myristoylation sites. 

amino acids 4-9, 34-39, 38-43, 47-52, 63-68, 173-178, 183-188 

Tetraspanin family homology, 
amino acids 10-194 



142/170 



FIGURE 131 

MKLGCVLMAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTEDDLTDAREAGFQVKAYTFSEPFHLIVSY 
DWLILQGPAKPVFEGDLLVLRCQAWQDWPLTQVTFYRDGSALGPPGPNREFSITVVQKADSGHYHCSGIFQSPGPGI 
PETASVVAITVQELFPAPILRAVPSAEPQAGSPMTLSCQTKLPLQRSAARLLFSFYKDGRIVQSRGLSSEFQIPTAS 
EDHSGSYWCEAATEDNQVWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAPEEAPGPLPPPPTPSSEDPGFSS 
PLGMPDPHLYHQMGLLLKHMQDVRVLLGHLLMELRELSGHQKPGTTKATAE 

Signal sequence, 
amino acids 1-17 

N-myristoylation sites. 

amino acids 153-158, 185-190, 236-241, 262-267, 352-357 

Leucine zipper pattern, 
amino acids 12-33 
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FIGURE 132 

MPGGCSRGPAAGDGRLRLARLALVLLGWVSSSSPTSSASSFSSSAPFLASAVSAQPPLPDQCPALCECSEAARTVKC 
VNRNLTEVPTDLPAYVRNLFLTGNQLAVLPAGAFARRPPLAELAALNLSGSRLDEVRAGAFEHLPSLRQLDLSHNPL 
ADLSPFAFSGSNASVSAPSPLVELILNHIVPPEDERQNRSFEGMVVAALLAGRALQGLRRLELASNHFLYLPRDVLA 
QLPSLRHLDLSNNSLVSLTYVSFRNLTHLESLHLEDNALKVLHNGTLAELQGLPHIRVFLDNNPWVCDCHMADMVTW 
LKETEVVQGKDRLTCAYPEKMRNRVLLELNSADLDCDPILPPSLQTSYVFLGIVLALIGAIFLLVLYLNRKGIKKWM 
HNIRDACRDHMEGYHYRYEINADPRLTNLSSNSDV 

Signal sequence, 
amino acids 1-31 

Transmembrane domain, 
amino acids 355-375 

N-glycosylation sites. 

amino acids 81-84, 124-127, 166-169, 192-195, 243-246, 256-259, 275-278, 413-416 

Tyrosine kinase phosphorylation site, 
amino acids 393-401 

N-myristoylation sites. 

amino acids 100-105, 164-169, 197-199, 360-365 

Leucine zipper pattern, 
amino acids 82-103 

Leucine rich repeats. 

amino acids 92-115, 119-142, 143-166, 211-234, 235-258, 259-282 

Leucine rich repeat C- terminal domain, 
amino acids 294-345 

Leucine rich repeat N- terminal domain, 
amino acids 61-90 
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FIGURE 133 

MAPPQVLAFGLLLAAATATFAAAQEECVCENYKLAVNCFVNNNRQCQCTSVGAQNTVICSKLAAKCLVMKAEMNGSK 
LGRRAKPEGALQNNDGLYDPDCDESGLFKAKQCNGTSTCWCVNTAGVRRTDKDTEITCSERVRTYWIIIELKHKARE 
KPYDSKSLRTALQKEITTRYQLDPKFITSILYENNVITIDLVQNSSQKTQNDVDIADVAYYFEKDVKGESLFHSKKM 
DLTVNGEQLDLDPGQTLIYYVDEKAPEFSMQGLKAGVlAVIVVVVIAVVAGIVVLVISRKKRMAKYEKAEIKEMGEM 
HRELNA 

Signal sequence, 
amino acids 1-23 

Transmembrane domain . 
amino acids 265-285 

N-glycosylation sites. 

amino acids 74-77, 111-114, 198-201 

Tyrosine kinase phosphorylation site, 
amino acids 151-157 

N-myristoylation sites. 

amino acids 10-15, 52-57, 75-80, 86-91, 103-108, 112-117, 123-128, 263-268 

Amidation site, 
amino acids 7 8-81 

Thyroglobulin type-1 repeat, 
amino acids 66-135 
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FIGURE 134A 



MPCGFSPSPVAHHLVPGPPDTPAQQLRCGWTVGGWLLSLVRGLLPCLPPGARTAEGPIMVLAGPLAVSLLLPSLTLL 
VSHLSSSQDVSSEPSSEQQLCALSKHPTVAFEDLQPWVSNFTYPGARDFSQLALDPSGNQLIVGARNYLFRLSLANV 
SLLQATEWASSEDTRRSCQSKGKTEEECQNYVRVLIVAGRKVFMCGTNAFSPMCTSRQVGNLSRTIEKINGVARCPY 
DPRHNSTAVISSQGELYAATVIDFSGRDPAIYRSLGSGPPLRTAQYNSKWLNEPNFVAAYDIGLFAYFFLRENAVEH 
DCGRTVYSRVARVCKNDVGGRFLLEDTWTTFMKARLNCSRPGEVPFYYNELQSAFHLPEQDLIYGVFTTNVNSIAAS 
AVCAFNLSAISQAFNGPFRYQENPRAAWLPIANPIPNFQCGTLPETGPNENLTERSLQDAQRLFLMSEAVQPVTPEP 
CVTQDSVRFSHLVVDLVQAKDTLYHVLYIGTESGTILKALSTASRSLHGCYLEELHVLPPGRREPLRSLRILHSARA 
LFVGLRDGVLRVPLERCAAYRSQGACLGARDPYCGWDGKQQRCSTLEDSSNMSLWTQNITACPVRNVTRDGGFGPWS 
PWQPCEHLDGDNSGSCLCRARSCDSPRPRCGGLDCLGPAIHIANCSRNGAWTPWSSWALCSTSCGIGFQVRQRSCSN 
PAPRHGGRIFVGKSREERFCNENTPCPVPIFWASWGSWSKCSSNCGGGMQSRRRACENGNSCLGCGEFKTCNPEGCP 
EVRRNTPWTPWLPVNVTQGGARQEQRFRFTCRAPLADPHGLQFGRRRTETRTCPADGSGSCDTDALVEVLLRSGSTS 
PHTVSGGWAAWGPWSSCSRDCELGFRVRKRTCTNPEPRNGGLPCVGDAAEYQDCNPQACPVRGAWSCWTSWSPCSAS 
CGGGHYQRTRSCTSPAPSPGEDICLGLHTEEALCATQACPGWSPWSEWSKCTDDGAQSRSRHCEELLPGSSACAGNS 
SQSRPCPYSEI PVILPASSMEEATDCAGKRNRTYLMLRSSQPSSTPLQSLDSFHILLQTAKLCWGPHCFEMGSISST 
WWPRASPASWALGS 



Signal sequence, 
amino acids 1-42 
Transmembrane domain, 
amino acids 60-80 
N-glycosylation sites. 

amino acids 117-120, 153-156, 215-218, 236-239, 345-348, 391-394, 436-439, 590- 

593, 597-600, 605-608, 660-663, 785-788, 1000-1003, 1032-1035 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 773-776, 815-818, 875-878 

Tyrosine kinase phosphorylation sites. 

amino acids 177-185, 348-356 

N-myristoylation sites. 

amino acids 42-47, 50-55, 373-378, 492-497, 543-548, 563-568, 630-635, 647-652, 
740-745, 810-815, 827-832, 829-834, 853-858, 887-892, 910-915, 993-998, 1073- 
1078 

Amidation sites. 

amino acids 192-195, 522-525, 813-816, 1028-1031 
ATP/GTP-binding site motif A (P-loop) . 
amino acids 700-707 

Cytochrome c oxidase subunit II, copper A binding region signature, 
amino acids 921-929 

Growth factor and cytokines receptors family signature 2. 

amino acids 967-973 

Sema domain. 

amino acids 126-537 
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FIGURE 134B 



Thrombospondin type 1 domains . 

amino acids 613-661, 668-719, 726-769, 856-906, 913-963, 967-1007 

Plexin repeat. 

amino acids 555-602 

Plant PEC family metallothionein . 

amino acids 712-791 
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FIGURE 135 

MAKDNSTVRCFQGLLIFGNVIIGCCGIALTAECIFFVSDQHSLYPLLEATDNDDIYGAAWIGIFVGICLFCLSVLGI 
VGIMKSSRKILLAYFILMFIVYAFEVASCITAATQQDFFTPNLFLKQMLERYQNNSPPNNDDQWKNNGVTKTWDRLM 
LQDNCCGVNGPSDWQKYTSAFRTENNDADYPWPRQCCVMNNLKEPLNLEACKLGVPGFYHNQGCYELISGPMNRHAW 
GVAWFGFAILCWTFWVLLGTMFYWSRIEY 

Transmembrane domains. 

amino acids 12-32, 54-74, 89-109, 231-251 

N-glycosylation site, 
amino acids 5-8 

N-myristoylation sites. 

amino acids 26-31, 62-67, 79-84, 145-150 

Tetraspanin family homology, 
amino acids 12-258 
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FIGURE 136 

MFDKTRLPYVALDVLCVLLAGLPFAIFTSRHITSRHTPFQRGVFCNDESIKYPYKEDTIPYALLGGII IPFSIIVII 
LGETLSVYCNLLHSNSFIRNNYIATI YKAIGTFLFGAAASQSLTDIAKYSIGRLRPHFLDVCDPDWSKINCSDGYIE 
YYICRGNAERVKEGRLSFYSGHSSFSMYCMLFVALYLQARMKGDWARLLRPTLQFGLVAVSIYVGLSRVSDYKHHWS 
DVLTGLIQGALVAILVAVYVSDFFKERTSFKERKEEDSHTTLHETPTTGNHYPSNHQP 

Transmembrane domains. 

amino acids 7-27, 61-81, 97-117, 172-192, 200-220, 233-253 

N-glycosylation site, 
amino acids 147-150 

N-myristoylation sites. 

amino acids 21-26, 42-47, 113-118, 236-241, 240-245 

PAP2 superfamily. 
amino acids 105-256 
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FIGURE 137 

MLLWLLLLILTPGREQSGVAPKAVLLLNPPWSTAFKGEKVALICSSISHSLAQGDTYWYHDEKLLKIKHDKIQITEP 
GNYQCKTRGSSLSDAVHVEFSPDWLILQALHPVFEGDNVILRCQGKDNKNTHQKVYYKDGKQLPNSYNLEKITVNSV 
SRDNSKYHCTAYRKFYILDIEVTSKPLNIQVQELFLHPVLRASSSTPIEGSPMTLTCETQLSPQRPDVQLQFSLFRD 
SQTLGLGWSRSPRLQIPAMWTEDSGSYWCEVETVTHSIKKRSLRSQIRVQRVPVSNVNLEIRPTGGQLIEGENMVLI 
CSVAQGSGTVTFSWHKEGRVRSLGRKTQRSLLAELHVLTVKESDAGRYYCAADNVHSPILSTWIRVTVRIPVSHPVL 
TFRAPRAHTVVGDLLELHCESLRGSPPILYRFYHEDVTLGNSSAPSGGGASFNLSLTAEHSGNYSCDADNGLGAQHS 
HGVSLRVTVPVSRPVLTLRAPGAQAVVGDLLELHCESLRGSFPILYWFYHEDDTLGNISAHSGGGASFNLSLTTEHS 
GNYSCEADNGLGAQHSKVVTLNVTGTSRNRTGLTAAGITGLVLSILVLAAAAALLHYARARRKPGGLSATGTSSHSP 
SECQEPSSSRPSRIDPQEPTHSKPLAPMELEPMYSNVNPGDSNPIYSQIWSIQHTKENSANCPMMHQEHEELTVLYS 
ELKKTHPDDSAGEASSRGRAHEEDDEENYENVPRVLLASDH 

Signal sequence, 
amino acids 1-13 
Transmembrane domain, 
amino acids 574-594 
N-glycosylation sites. 

amino acids 426-429, 438-441, 448-451, 519-522, 531-534, 541-544, 561-561, 568- 
571 

Glycosaminoglycan attachment sites, 
amino acids 431-434, 524-527 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 270-273 

Tyrosine kinase phosphorylation site, 
amino acids 34 9-356 
N-myristoylation sites. 

amino acids 78-83, 86-91, 204-209, 236-241, 256-261, 432-437, 434-439, 447-452, 
458-463, 518-523, 525-530, 527-532, 540-545, 551-556, 564-569, 571-576, 579-584, 
604-609, 605-610 
Amidation site, 
amino acids 331-334 

N-6 Adenine-specif ic DNA methylases signature, 
amino acids 25-31 
Immunoglobulin domains. 

amino acids 37-84, 113-165, 204-262, 302-360, 397-453, 490-546 
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FIGURE 138 

MEGGAAAATPTALPYYVAFSQLLGLTLVAMTGAWLGLYRGGIAWESDLQFNAHPLCMVIGLIFLQGNALLVYRVFRN 
EAKRTTKVLHGLLHIFALVIALVGLVAVFDYHRKKGYADLYSLHSWCGILVFVLYFVQWLVGFSFFLFPGASFSLRS 
RYRPQHIFFGATI FLLPVGTALLGLKEALLFNLGGKYSAFEPEGVLANVLGLLLACFGGAVLYILTRADWKRPSQAE 
EQALSMDFKTLRQGDSPGSQ 

Transmembrane domains . 

amino acids 13-33, 51-71, 85-105, 123-143, 161-181, 198-218 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 80-83, 225-228 

N-myristoylation sites. 

amino acids 3-8, 4-9, 32-37, 36-41, 147-152, 178-183, 188-193, 198-203, 205-210 

Cytochrome b561 homology, 
amino acids 1-238 
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FIGURE 139 

MRQLCRGRVLGISVAIAHGVFSGSLNILLKFLISRYQFSFLTLVQCLTSSTAALSLELLRRLGLIAVPPFGLSLARS 
FAGVAVLSTLQSSLTLWSLRGLSLPMYVVFKRCLPLVTMLIGVLVLKNGAPSPGVLAAVLITTCGAALAGAGDLTGD 
PIGYVTGVLAVLVHAAYLVLIQKASADTEHGPLTAQYVIAVSATPLLVICSFASTDSIHAWTFPGWKDPAMVCIFVA 
CILIGCAMNFTTLHCTYINSAVTTSLFIAGVVVNTLGSIIYCVAKFMETRKQSNYEDLEAQPRGEEAQLSGDQLPFV 
MEELPGEGGNGRSEGGEAAGGPAQESRQEVRGSPRGVPLVAGSSEEGSRRSLKDAYLEVWRLVRGTRYMKKDYLIEN 
EELPSP 

Transmembrane domains. 

amino acids 8-28, 51-71, 103-123, 119-139, 153-173, 189-209, 221-241, 253-273 

N-glycosylation site, 
amino acids 240-243 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 281-284 

N-myristoylation sites. 

amino acids 11-16, 19-24, 71-76, 131-136, 142-147, 236-241, 261-266, 317-322, 
323-328, 340-345, 355-360 
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FIGURE 140 



MARLALSPVPSHWMVALLLLLSAEPVPAARSEDRYRNPKGSACSRIWQSPRFIARKRGFTVKMHCYMNSASGNVSWL 
WKQEMDENPQQLKLEKGRMEESQNESLATLTIQGIRFEDNGIYFCQQKCNNTSEVYQGCGTELRVMGFSTLAQLKQR 
NTLKDGIIMIQTLLIILFIIVPIFLLLDKDDSKAGMEEDHTYEGLDIDQTATYEDIVTLRTGEVKWSVGEHPGQE 

Signal sequence, 
amino acids 1-28 

Transmembrane domain . 
amino acids 159-179 

N-glycosylation sites. 

amino acids 73-76, 101-104, 127-130, 128-131 

Tyrosine kinase phosphorylation site, 
amino acids 113-120 

N-myristoylation sites, 
amino acids 40-45, 118-123 

Immunoglobulin domain, 
amino acids 58-124 

Immunoreceptor tyrosine-based activation motif, 
amino acids 193-213 
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FIGURE 141 

MGLPEPGPLRLLALLLLLLLLLLLRLQHLAAAAADPLLGGQGPAKECEKDQFQCRNERCIPSVWRCDEDDDCLDHSD 
EDDCPKKTCADSDFTCDNGHCIHERWKCDGEEECPDGSDESEATCTKQVCPAEKLSCGPTSHKCVPASWRCDGEKDC 
EGGADEAGCATSLGTCRGDEFQCGDGTCVLAIKHCNQEQDCPDGSDEAGCLQGLNECLHNNGGCSHICTDLKIGFEC 
TCPAGFQLLDQKTCGDIDECKDPDACSQICVNYKGYFKCECYPGCEMDLLTKNCKAAAGKSPSLIFTNRTSAEDRPV 
KRNYSRLIPMLKNWALDVEVATNRIYWCDLSYRKIYSAYMDKASDPKEREVLIDEQLHSPEGLAVDWVHKHIYWTD 
SGNKTISVATVDGGRRRTLFSRNLSEPRAIAVDPLRGFMYWSDWGDQAKIEKSGLNGVDRQTLVSDNIEWPNGITLD 
LLSQRLYWVDSKLHQLSSIDFSGGNRKTLISSTDFLSHPFGIAVFEDKVFWTDLENEAIFSANRLNGLEISILAENL 
NNPHDIVIFHELKQPRAPDACELSVQPNGGCEYLCLPAPQISSHSPKYTCACPDTMWLGPDMKRCYRDANEDSKMGS 
TVTAAVIGIIVPI VVIALLCMSGYLIWRNWKRKNTKSMNFDNPVYRKTTEEEDEDELHIGRTAQIGHVYPARVALSL 
EDDGLP 

Signal sequence, 
amino acids 1-26 
Transmembrane domain, 
amino acids 619-639 
N-glycosylation sites. 

amino acids 299-302, 311-314, 388-391, 408-411 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 400-403, 648-651, 662-665 
N-myristoylation sites. 

amino acids 40-45, 114-119, 157-162, 162-167, 168-173, 198-203, 203-208, 207- 
212, 486-491, 529-534, 615-620 
Amidation site, 
amino acids 398-401 

Aspartic acid and asparagine hydroxylation sites, 
amino acids 222-233, 261-272 
Cell attachment sequence, 
amino acids 171-173 

Low-density lipoprotein receptor repeats. 

amino acids 332-377, 379-420, 422-464, 466-509, 510-550 

Low-density lipoprotein receptor domains. 

amino acids 45-83, 84-124, 125-165, 168-206 
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FIGURE 142 

MPPLWALLALGCLRFGSAVNLQPQLASVTFATNNPTLTTVALEKPLCMFDSKEALTGTHEVYLYVLVDSAISRNASV 
QDSTNTPLGSTFLQTEGGRTGPYKAVAFDLIPCSDLPSLDAIGDVSKASQILNAYLVRVGANGTCLWDPN FQGLCNA 
PLSAATEYRFKYVLVNMSTGLVEDQTLWSDPIRTNQLTPYSTIDTWPGRRSGGMIVITSILGSLPFFLLVGFAGAIA 
LSLVDMGSSDGETTHDSQITQEAVPKSLGASESSYTSVNRGPPLDRAEVYSSKLQD 

Signal sequence, 
amino acids 1-18 

Transmembrane domain, 
amino acids 211-231 

N-glycosylation sites. 

amino acids 74-77, 139-142, 170-173 

N-myristoylation sites. 

amino acids 16-21, 137-142, 238-243, 260-265 

Amidation site, 
amino acids 201-204 
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FIGURE 143 

MRKLIAGLIFLKFWTYTVRASTDLPQTENCFQYIHQVTEISSTLPVALLRDEVPGWFLKVPEPQLISKELIMLTEVM 
EVWHGLVIAVVSLFLQACFLTAINYLLSRHMAHKSEQILKAAS LQVPRPSPGHHHPPAVKEMKETQTERDIPMSDSL 
YRHDSDTPSDSLDSSCSSPPACQATEDVDYTQVVFSDPGELK 

Signal sequence, 
amino acids 1-20 

Transmembrane domain, 
amino acids 75-95 

N-myristoylation site, 
amino acids 82-87 

Leucine zipper pattern, 
amino acids 83-104 
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FIGURE 144 

MAGTVRTACLVVAMLLSLDFPGQAQPPPPPPDATCHQVRSFFQRLQPGLKWVPETPVPGSDLQVCLPKGPTCCSRKM 
EEKYQLTARLNMEQLLQSASMELKFLIIQNAAVFQEAFEIVVRHAKNYTNAMFKNNYPSLTPQAFEFVGEFFTDVSL 
YILGSDINVDDMVNELFDSLFPVIYTQLMNPGLPDSALDINECLRGARRDLBCVFGNFPKLIMTQVSKSLQVTRIFLQ 
ALNLGIEVINTTDHLKFSKDCGRMLTRMWYCSYCQGLMMVKPCGGYCNVVMQGCMAGVVEIDKYWREYILSLEELVN 
GMYRIYDMENVLLGLFSTIHDSIQYVQKNAGKLTTTIGKLCAHSQQRQYRSAYYPEDLFIDKKVLKVAHVEHEETLS 
SRRRELIQKLKSFISFYSALPGYICSHSPVAENDTLCWNGQELMERYSQKAARNGMKNQFNLHELKMKGPEPVVSQI 
IDKLKHINQLLRTMSMPKGRVLDKNLDEEGFESGDCGDDEDECIGGSGDGMIKVKNQLRFLAELAYDLDVDDAPGNS 
QQATPKDNEISTFHNLGNVHSPLKLLTSMAISVVCFFFLVH 

Signal sequence, 
amino acids 1-24 

Transmembrane domain, 
amino acids 559-579 

N-glycosylation sites. 

amino acids 124-127, 241-244, 418-421 

Glycosaminoglycan attachment site, 
amino acids 509-512 

Tyrosine kinase phosphorylation sites, 
amino acids 75-81, 521-528 

N-myristoylation sites. 

amino acids 3-8, 158-163, 186-191, 275-280, 284-289, 322-327, 508-513 

Glypican homology, 
amino acids 4-578 
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FIGURE 145 

MGLPRGPLASLLLLQVCWLQCAASEPCRAVFREAEVTLEAGGAEQEPGQALGKVFMGCPGQEPALFSTDNDDFTVRN 
GETVQERRSLKERNPLKIFPSKRILRRHKRDWVVAPISVPENGKGPFPQRLNQLKSNKDRDTKIFYSITGPGADSPP 
EGVFAVEKETGWLLLNKPLDREEIAKYELFGHAVSENGASVEDPMNISIIVTDQNDHKPKFTQDTFRGSVLEGVLPG 
TSVMQVTATDEDDAIYTYNGWAYSIHSQEPKDPHDLMFTIHRSTGTISVISSGLDREKVPEYTLTIQATDMDGDGS 
TTTAVAWEILDANDNAPMFDPQKYEAHVPENAVGHEVQRLTVTDLDAPNSPAWRATYLIMGGDDGDHFTITTHPES 
NQGILTTRKGLDFEAKNQHTLYVEVTNEAPFVLKLPTSTATIVVHVEDVNEAPVFVPPSKVVEVQEGIPTGEPVCVY 
TAEDPDKENQKISYRILRDPAGWLAMDPDSGQVTAVGTLDREDEQFVRNNI YEVMVLAMDNGSPPTTGTGTLLLTLI 
DVNDHGPVPEPRQITICNQSPVRHVLNITDKDLSPHTSPFQAQLTDDSDIYWTAEVNEEGDTVVLSLKKFLKQDTYD 
VHLSLSDHGNKEQLTVIRATVCDCHGHVETCPGPWKGGFILPVLGAVLALLFLLLVLLLLVRKKRKIKEPLLLPEDD 
TRDNVFYYGEEGGGEEDQDYDITQLHRGLEARPEVVLRNDVAPTIIPTPMYRPRPANPDEIGNFIIENLKAANTDPT 
APPYDTLLVFDYEGSGSDAASLSSLTSSAS DQDQDYDYLNEWGSRFKKLADMYGGGEDD 

Signal sequence, 
amino acids 1-24 

Transmembrane domain, 
amino acids 655-675 

N-glycosylation sites, 
amino acids 200-203, 566-569 

N-myristoylation sites. 

amino acids 48-53, 60-65, 227-232, 307-312, 370-375, 388-393, 452-457, 493-498, 
524-529, 661-666, 786-791 

Cadherins extracellular repeated domain signature, 
amino acids 203-213, 316-326, 536-546 

Cadherin domains. 

amino acids 122-206, 220-319, 333-432, 445-539, 551-638 

Cadherin cytoplasmic region, 
amino acids 678-826 
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FIGURE 146 

GDCGDRGTARGTRREGTGIRSSGRAMDGNDNVTLLFAPLLRDNYTLAPNASSLGPGTDLALAPASSAGPGPGLSLGP 
GPSFGFSPGPTPTPEPTTSGLAGGAASHGPSPFPRPWAPHALPFWDTPLNHGLNVFVGAALCITMLGLGCTVDVNHF 
GAHVRRPVGALLAALCQFGLLPLLAFLLALAFKLDEVAAVAVLLCGCCPGGNLSNLMSLLVDGDMNLSIIMTISSTL 
LALVLMPLCLWIYSWAWINTPIVQLLPLGTVTLTLCSTLIPIGLGVFIRYKYSRVADYIVKVSLWSLLVTLVVLFIM 
TGTMLGPELLASIPAAVYVIAIFMPLAGYASGYGLATLFHLPPNCKRTVCLETGSQNVQLCTAILKLAFPPQFIGSM 
YMFPLLYALFQSAEAGIFVLIYKMYGSEMLHKRDPLDEDEDTDISYKKLKEEEMADTSYGTVKAENIIMMETAQTSL 

Transmembrane domains. 

amino acids 126-146, 161-181, 177-197, 218-238, 254-274, 289-309, 326-346 
N-glycosylation sites. 

amino acids 31-34, 43-46, 49-52, 206-209, 220-223 

Glycosaminoglycan attachment site, 
amino acids 339-342 

N-myristoylation sites. 

amino acids 7-12, 18-23, 56-61, 97-102, 100-105, 135-140, 144-149, 163-168, 200- 
205, 204-209, 205-210, 445-450 

Sodium Bile acid symporter family homology, 
amino acids 135-316 
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FIGURE 147 

MVRRDRLRRMREWWVQVGLLAVPLLAAYLHIPPPQLSPALHSWKSSGKFFTYKGLRIFYQDSVGVVGSPEIVVLLHG 
FPTSSYDWYKIWKGLTLRFHRVIALDFLGFGFSDKPRPHHYSIFEQASIVEALLRHLGLQNRRINLLSHDYGDIVAQ 
ELLYRYKQNRSGRHTIKSLCLSNGGIFPETHRPLLLQKLLKDGGVLSPILTRLMNFFVFSRGLTPVFGPYTRPSESE 
LWDMWAGIRNNDGNLVIDSLLQYINQRKKFRRRWVGALASVTIPIHFIYGPLDPVNPYPEFLELYRKTLPRSTVSIL 
DDHISHYPQLEDPMGFLNAYMGFINSF 

Transmembrane domains . 
amino acids 12-32, 59-79 

N-glycosylation site, 
amino acids 163-166 

N-myristoylation sites, 
amino acids 238-243, 267-272 

alpha/beta hydrolase fold, 
amino acids 97-332 
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FIGURE 148 



GWTSHLSNCGESNRPPKERSCFRVCDWHSDLFQWEVSDWHHCVLVPYARGEVKPRTAECVTAQHGLQHRMVRCIQKL 
NRTVVANEICEHFALQPPTEQACLIPCPRDCVVSEFLPWSNCSKGCGKKLQHRTRAVIAPPLFGGLQCPNLTESRAC 
DAPISCPLGEEEYTFSLKVGPWSKCRLPHLKEINPSGRTVLDFNSDSNERVTFKHQSYKAHHHSKSWAIEIGYQTRQ 
VSCTRSDGQNAMLSLCLQDSFPLTVQSCIMPKDCETSQWSSWSPCSKTCRSGSLLPGFRSRSRNVKHMAIGGGKECP 
ELLEKEACIVEGELLQQCPRYSWRTSEWKECQVSLLLEQQDPHWHVTGPVCGGGIQTREVYCAQSVPAAAALRAKEV 
SRPVEKALCVGPAPLPSQLCNIPCSTDCIVSSWSAWGLCIHENCHEPQGKKGFRTRQRHVLMESTGPAGHCPHLVES 
VPCEDPMCYRWLASEGICFPDHGKCGLGHRILKAVCQNDRGEDVSGSLCPVPPPPERKSCEI PCRMDCVLSEWTEWS 
SCSQSCSNKNSDGKQTRSRTILALAGEGGKPCPPSQALQEHRLCNDHSCMQLHWETSPWGPCSEDTLVTALNATIGW 
NGEATCGVGIQTRRVFCVKSHVGQVMTKRCPDSTRPETVRPCFLPCKKDCIVTAFSEWTPCPRMCQAGNATVKQSRY 
RIIIQEAANGGQECPDTLYEERECEDVSLCPVYRWKPQKWSPCILVPESVWQGITGSSEACGKGLQTRAVSCISDDN 
RSAEMMECLKQTNGMPLLVQECTVPCREDCTFTAWSKFTPCSTNCEATKSRRRQLTGKSRKKEKCQDSDLYPLVETE 
LCPCDEFISQPYGNWSDCILPEGRREPHRGLRVQADSKECGEGLRFRAVACSDKNGRPVDPSFCSSSGYIQEKCVIP 
CPFDCKLSDWSSWGSCSSSCGIGVRIRSBWLKEKPYNGGRPCPKLDLKNQAQVHEAVPCYSECNQYSWVVEHWSSCK 
INNELRSLRCGGGTQSRKIRCVNTADGEGGAVDSNLCNQDEIPPETQSCSLMCPNECVMSEWGLWSKCPQSCDPHTM 
QRRTRHLLRPSLNSRTCAEDSQVQPCLLNENCFQFQYNLTEWSTCQLSENAPCGQGVRTRLLSCVCSDGKPVSMDQC 
EQHNLEKPQRMSIPCLVECVVNCQLSGWTAWTECSQTCGHGGRMSRTRFIIMPTQGEGRPCPTELTQEKTCPVTPCY 
SWVLGNWSACKLEGGDCGEGVQIRSLSCMVHSGSISHAAGRVEDALCGEMPFQDSILKQLCSVPCPGDCHLTEWSEW 
STCELTCIDGRSFETVGRQSRSRTFIIQSFENQDSCPQQVLETRPCTGGKCYHYTWKASLWNNNERTVWCQRSDGVN 
VTGGCSPQARPAAIRQCIPACRKPFSYCTQGGVCGCEKGYTEIMRSNGFLDYCMKVPGSEDKKADVKNLSGKNRPVN 
SKIHDIFKGV3SLQPLDPDGRVKIWVYGVSGGAFLIMIFLIFTSYLVCKKPKPHQSTPPQQKPLTLAYDGDLDMN 



Transmembrane domain, 
amino acids 1486-1506 
N-glycosylation sites. 

amino acids 78-81, 118-121, 147-150, 611-614, 685-688, 770-773, 861-864, 1116- 
1119, 1238-1241, 1386-1389, 1454-1457 
Tyrosine kinase phosphorylation sites, 
amino acids 834-841, 953-960 
N-myristoylation sites. 

amino acids 361-366, 639-644, 746-751, 749-754, 938-943, 1013-1018, 1031-1036, 
1196-1201, 1237-1242, 1246-1251, 1384-1389, 1417-1422, 1418-1423, 1421-1426, 
1490-1495 
Amidation sites. 

amino acids 123-126, 433-436, 869-872 

Growth factor and cytokines receptors family signature 2. 
amino acids 268-274, 1061-1067, 1304-1310 
Thrombospondin type 1 domains. 

amino acids 111-160, 268-326, 416-470, 533-588, 593-658, 669-723, 803-851, 932- 
983, 1061-1110, 1181-1231, 1304-1360 
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FIGURE 149 

EKPVRKQTPPTTQIHCGPPKPVLSPSFKT PATPLGLSTSTGHMLMPLCGLLWWWWCCCSGWYCYGLCAPAPQMLRHQ 
GLLKCRCRMLFNDLKVFLLRRPPQAPLPMHGDPQPPGLAANNTLPALGAGGWAGWRGPREVVGREPPPVPPPPPLPP 
SSVEDDWGGPATEPPASLLSSASSDDFCKEKTEDRYSLGSSLDSGMRTPLCRICFQGPEQGELLSPCRCDGSVKCTH 
QPCLIKWISERGCWSCELCYYKYHVIAISTKNPLQWQAISLTVIEKVQVAAAILGSLFLIASISWLIWSTFSPSARW 
QRQDLLFQICYGMYGFMDVVCIGLIIHEGPSVYRIFKRWQAVNQQWKVLNYDKTKDLEDQKAGGRTNPRTSSSTQAN 
IPSSEEETAGTPAPEQGPAQAAGHPSGPLSHHHCAYTILHILSHLRPHEQRSPPGSSRELVMRVTTVN 

Transmembrane domains . 
amino acids 275-295, 316-336 

N-glycosylation site, 
amino acids 118-121 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 5-8 

Tyrosine kinase phosphorylation site, 
amino acids 183-190 

N-myristoylation sites. 

amino acids 35-40, 78-83, 114-119, 127-132, 162-167, 225-230, 243-248 
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FIGURE 150 

QEQGDKMMEEYSLEKNERACIDFAISAKPLTRHMPQNKQSFQYRMWQFVVSPPFEYTIMAMIALNTIVLMMKFYGAS 
VAYENALRVFNIVFTSLFSLECVLKVMAFGILNYFRDAWNIFDFVTVLGSITDILVTEFGNNFINLSFLRLFRAARL 
IKLLRQGYTIRILLWTFVQSFKALPYVCLLIAMLFFIYAIIGMQVFGNIGIDVEDEDSDEDEFQITEHNNFRTFFQA 
LMLLFRSATGEAWHNIMLSCLSGKPCDKNSGILTRECGNEFAYFYFVSFI FLCSFLMLNLFVAVIMDNFEYLTRDSS 
ILGPHHLDEYVRVWAEYDPAACGRIHYKDMYSLLRVISPPLGLGKKCPHRVACKRLLRMDLPVADDNTVHFNSTLMA 
LIRTALDIKIAKGGADKQQMDAELRKEMMAIWPNLSQKTLDLLVTPHKSTDLTVGKIYAAMMIMEYYRQSKAKKLQA 
MREEQDRTPLMFQRMEPPSPTQEGGPGQNALPSTQLDPGGALMAHESGLKESPSWVTQRAQEMFQKTGTWSPEQGPP 
TDMPNSQPNSQSVEMREMGRDGYSDSEHYLPMEGQGRAASMPRLPAENQTISDTSPMKRSASVLGPKARRLDDYSLE 
RVPPEENQRHHQRRRDRSHRASERSLGRYTDVDTGLGTDLSMTTQSGDLPSKERDQERGRPKDRKHRQHHHHHHHHH 
HPPPPDKDRYAQERPDHGRARARDQRWSRSPSEGREHMAHRQGSSSVSGSPAPSTSGTSTPRRGRRQLPQTPSTPRP 
HVSYSPVIRKAGGSGPPQQQQQQQQQQQQAVARPGRAATSGPRRYPGPTAEPLAGDRPPTGGHSSGRSPRMERRVPG 
PARSES PRACRHGGARWPASGPHVSEGPPGPRHHGYYRGSDYDEADGPGSGGGEEAMAGAYDAPPPVRHASSGATGR 
SPRTPRASGPACASPSRHGRRLPNGYYPAHGLARPRGPGSRKGLHEPYSESDDDWCN 

Transmembrane domains. 

amino acids 49-69, 86-106, 114-134, 131-151, 167-187, 183-203, 222-242, 274-294 
N-glycosylation sites. 

amino acids 142-145, 380-383, 419-422, 587-590 

Glycosaminoglycan attachment site, 
amino acids 897-900 

N-myristoylation sites. 

amino acids 75-80, 269-274, 502-507, 573-578, 653-658, 831-836, 896-901, 899- 
904, 963-968 

Amidation sites. 

amino acids 351-354, 756-759, 942-945 

Ion transport protein homology, 
amino acids 83-296 
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FIGURE 151 

MRPVALLLLPSLLALLAHGLSLEAPTVGKGQAPGIEETDGELTAAPTPEQPERGVHFVTTAPTLKLLNHHPLLEEFL 
HEGLEKGDEELRPALSFQPDPPAPFTPSPLPRLANQDSRPVFTSPTPAMAAVPTQPQSKEGPWSPDPESESPMLRIT 
APLPPGPSMAVPTLGPGEIASTTPPSRAWTPTQEGPGDMGRPWVAEVVSQGAGIGIQGTITSSTASGDDEETTTTTT 
IITTTITTVQTPGPCSWNFSGPEGSLDSPTDLSSPTDVGLDCFFYISVYPGYGVEIKVKNISLREGETVTVEGLGGP 
DPLPLANQSFLLRGQVIRSPTHQAALRFQSLPPPAGPGTFHFHYQAYLLSCHFPRRPAYGDVTVTSLHPGGSARFHC 
ATGYQLKGARHLTCLNATQPFWDSKEPVCIGECPGVIRNATTGRIVSPGFPGNYSNNLTCHWLLEAPEGQRLHLHFE 
KVSLAEDDDRLIIRNGDNVEAPPVYDSYEVEYLPIEGLLSSGKHFFVELSTDSSGAAAGMALRYEAFQQGHCYEPFV 
KYGNFSSSTPTYPVGTTVEFSCDPGYTLEQGSIIIECVDPHDPQWNETEPACRAVCSGEITDSAGVVLSPNWPEPYG 
RGQDCIWGVHVEEDKRIMLDIRVLRIGPGDVLTFYDGDDLTARVLGQYSGPRSHFKLFTSMADVTIQFQSDPGTSVL 
GYQQGFVIHFFEVPRNDTCPELPEIPNGWKSPSQPELVHGTVVTYQCYPGYQVVGSSVLMCQWDLTWSEDLPSCQRV 
TSCHDPGDVEHSRRLISSPKFPVGATVQYICDQGFVLMGSSILTCHDRQAGSPKWSDRAPKCLLEQLKPCHGLSAPE 
NGARSPEKQLHPAGATIHFSCAPGYVLKGQASIKCVPGHPSHWSDPPPICRAASLDGSTTVAAWMVAKAPAASSTLD 
AAHIAAAIFLPLVAMVLLVGGVYFYFSRLQGKSSLQLPRPRPRPYNRITIESAFDNPTYETGSLSFAGDERI 

Signal sequence, 
amino acids 1-17 

Transmembrane domain, 
amino acids 928-948 

N-glycosylation sites. 

amino acids 249-252, 291-294, 315-318, 401-404, 424-427, 438-441, 442-445, 543- 
546, 585-588, 709-712 

N-myristoylation sites. 

amino acids 30-35, 34-39, 80-85, 205-210, 209-214, 212-217, 420-425, 437-442, 
499-504, 517-522, 542-547, 733-738 

Sushi domain (SCR repeat) . 

amino acids 359-414, 534-591, 712-767, 773-832, 840-897 
CUB domains. 

amino acids 418-526, 595-703 
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FIGURE 152 

MVQKSRNGGVYPGPSGEKKLKVGFVGLDPGAPDSTRDGALLIAGSEAPKRGSILSKPRAGGAGAGKPPKRNAFYRKL 
QNFLYNVLERPRGWAFIYHAYVFLLVFSCLVLSVFSTIKEYEKSSEGALYILEIVTIVVFGVEYFVRIWAAGCCCRY 
RGWRGRLKFARKPFCVIDIMVLIASIAVLAAGSQGNVFATSALRSLRFLQILRMIRMDRRGGTWKLLGSVVYAHSKE 
LVTAWYIGFLCLILASFLVYLAEKGENDHFDTYADALWWGLITLTTIGYGDKYPQTWNGRLLAATFTLIGVSFFALP 
AGILGSGFALKVQEQHRQKHFEKRRNPAAGLIQSAWRFYATNLSRTDLHSTWQYYERTVTVPMYRLIPPLNQLELLR 
NLKSKSGLAFRKDPPPEPSPSQKVSLKDRVFSSPRGVAAKGKGSPQAQTVRRSPSADQSLEDSPSKVPKSWSFGDRS 
RARQAFRIKGAASRQNSEEASLPGEDIVDDKSCPCEFVTEDLTPGLKVSIRAVCVMRFLVSKRKFKESLRPYDVMDV 
IEQYSAGHLDMLSRIKSLQSRVDQIVGRG PAITDKDRTKGPAEAELPEDPSMMGRLGKVEKQVLSMEKKLDFLVNIY 
MQRMGIPPTETEAYFGAKEPEPAPPYHSPEDSREHVDRHGCIVKIVRSSSSTGQKNFSAPPAAPPVQCPPSTSWQPQ 
SHPRQGHGTSPVGDHGSLVRIPPPPAHERSLSAYGGGNRASMEFLRQEDTPGCRPPEGTLRDSDTSISIPSVDHEEL 
ERSFSGFSISQSKENLDALNSCYAAVAPCAKVRPYIAEGESDTDSDLCTPCGPPPRSATGEGPFGDVGWAGPRK 

Transmembrane domains . 

amino acids 89-109, 127-147, 165-185, 184-204, 229-249, 295-315 

N-glycosylation sites, 
amino acids 350-353, 672-675 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 49-52 

Tyrosine kinase phosphorylation site, 
amino acids 120-127 

N-myristoylation sites. 

amino acids 26-31, 30-35, 51-56, 60-65, 61-66, 186-191, 189-194, 310-315, 313- 
318, 338-343, 428-433, 507-512, 621-626, 729-734, 730-735 

Ion transport protein homology, 
amino acids 125-312 

KCNQ voltage-gated potassium channel homology, 
amino acids 417-653 
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FIGURE 153 

MVFAHRMDNSKPHLIIPTLLVPLQNRSCTETATPLPSQYLMELSEEHSWMSNQTDLHYVLKPGEVATASIFFGILWL 
FSIFGNSLVCLVIHRSRRTQSTTNYFVVSMACADLLISVASTPFVLLQFTTGRWTLGSATCKVVRYFQYLTPGVQIY 
VLLSICIDRFYTIVYPLSFKVSREKAKKMIAASWIFDAGFVTPVLFFYGSNWDSHCNYFLPSSWEGTAYTVIHFLVG 
FVIPSVLIILFYQKVIKYIWRIGT DGRTVRRTMNIVPRTKVKTIKMFLILNLLFLLSWLPFHVAQLWHPHEQDYKKS 
SLVFTAITWISFSSSASKPTLYSI YNANFRRGMKETFCMSSMKCYRSNAYTITTSSRMAKKNYVGISEIPSMAKTIT 
KDSIYDSFDREAKEKKLAWPINSNPPNTFV 

Transmembrane domains. 

amino acids 67-87, 105-125, 147-167, 184-204, 225-245, 273-293, 311-331 

N-glycosylation sites, 
amino acids 25-28, 52-55 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 306-309 

N-myristoylation sites. 

amino acids 134-139, 220-225, 340-345 

7 transmembrane receptor (rhodopsin family) homology, 
amino acids 82-330 
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FIGURE 154 

MAVRPGLWPALLGIVLAAWLRGSGAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKC 
NGEWVRQVDHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQE 
PLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHSLVVRQARLADTANYTCVAKNIVARRR 
SASAAVIVYVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHTASGPEDVALYVG 
LIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQ 
DGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGIS 
LLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQS 
CEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLH 
DTHDALKEVVQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 
LERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 
CRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 

Signal sequence, 
amino acids 1-25 
Transmembrane domain, 
amino acids 304-324 
N-glycosylation sites. 

amino acids 107-110, 218-221, 287-290, 441-444, 682-685, 725-728, 816-819 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 229-232, 536-539 

Tyrosine kinase phosphorylation site. 

amino acids 212-219 

N-myristoylation sites. 

amino acids 6-11, 13-18, 24-29, 133-138, 175-180, 255-260, 274-279, 405-410, 
509-514, 717-722, 828-833 

Growth factor and cytokines receptors family signature 2. 

amino acids 243-249, 246-252 

ZU5 domain. 

amino acids 439-542 

Death domain. 

amino acids 761-841 
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FIGURE 155 



MAQALLVPPGPESFRLFTRESLAAIEKRAAEEKAKKPKKEQDNDDENKPKPNSDLEAGKNLPFIYGDIPPEiyiVSEPL 
EDLDPYYINKKTFIVMNKGKAIFRFSATSALYILTPLNPVRKIAIKILVHSLFSMLIMCTILTNCVFMTLSNPPDWT 
KNVEYTFTGI YTFESLIKILVRGFCLEDFTFLRDPWNWLDFSVIVMAYVTEFVSLGNVSALRTFRVLRALKTISVIP 
GLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCLQWPPSDSAFETNTTSYFNGTMDSNGTFVNV 
TMSTFNWKDNIGDDSHFYVLDGQKDPLLCGNGSDAGQCPEGYICVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDYW 
ENLYQLTLRAAGKTYTIFFVLVIFLGSFYLVNLILAVVAMAYEEQNQATLEEAEQKEAEFQQMLEQLKKQQEEAQAV 
AAASAASRDFSGIGGLGELLESS SEAS KLSSKSAKEWRNRRKKRRQREHLEGNNKGERDSFPKSES EDS VKRSSFLF 
SMDGNRLTSDKKFCSPHQSLLSIRGSLFSPRRNSKTSIFSFRGRAKDVGSENDFADDEHSTFEDSETRRDSLFEPHR 
HGERRNSNGTTTETEVRKRRLSSYQISMEMLEDSSGRQRAVSIASILTNTMEELEESRQKCPPCWYRFANVFLIWDC 
CDAWLKVKHLVNLIVMDPFVDLAITICIVLNTLFMAMEHYPMTEQFSSVLTVGNLVFTGIFTAEMVLKIIAMDPYYY 
FQEGWNIFDGIIVSLSLMELGLSNVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKI IGNSVGALGNLTLVLAII VFIF 
AVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQTMCLIVFMLVMVIGNL 
VVLNLFLALLLSSFSSDNLAATDDDNEMNNLQIAVGRMQKGIDYVKNKMRECFQKAFFRKPKVIEIHEGNKIDSCMS 
NNTGIEISKALNYLRDGNGTTSGVGTGSSVEKYVIDENDYMSFINNPSLTVTVPIAVGESDFENLNTEEFSSESELE 
ESKEKLNATSSSEGSTVDVVLPREGEQAETEPEEDLKPEACFTEGCIKKFPFCQVSTEEGKGKIWWNLRKTCYSIVE 
HNWFETFIVFMILLSSGALAFEDIYIEQRKTIKTMLEYADKVFTYIFILEMLLKWVAYGFQTYFTNAWCRLDFLIVD 
VSLVSLVANALGYSELGAIKSLRTLRALRPLRALSRFEGMRVVVNALVGAIPSIMNVLLVCLI FWLIFSIMGVNLFA 
GKFYHCVNMTTGNMFDISDVNNLSDCQALGKQARWKNVKVNFDNVGAGYLALLQVVS 



Transmembrane domains. 

amino acids 124-144, 249-269, 402-422, 707-727, 740-760, 778-798, 821-841, 
837-857, 874-894, 914-934, 1155-1175, 1192-1212, 1224-1244, 1240-1260, 
1269-1289, 1285-1305 
N-glycosylation sites. 

amino acids 211-214, 290-293, 296-299, 302-305, 307-310, 339-342, 624-627, 
835-838, 1002-1005, 1019-1022, 1085-1088, 1317-1320, 1331-1334 
Glycosaminoglycan attachment sites, 
amino acids 473-476, 1023-1026 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 245-248, 533-536, 570-573, 607-610, 620-623, 635-638 
Tyrosine kinase phosphorylation site, 
amino acids 1033-1041 
N-myristoylation sites . 

amino acids 210-215, 271-276, 297-302, 303-308, 340-345, 514-519, 543-548, 
752-757, 780-785, 827-832, 831-836, 907-912, 1005-1010, 1018-1023, 1020-1025, 
1024-1029, 1026-1031, 1249-1254, 1281-1286 
ATP/GTP-binding site motif A (P-loop) . 
amino acids 851-858 
Ion transport protein. 

amino acids 153-423, 742-934, 1190-1365 
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FIGURE 156 



MAQALLVPPGPESFRLFTRESLAAIEKRAAEEKAKKPKKEQDNDDENKPKPNSDLEAGKNLPFIYGDIPPEMVSEPL 
EDLDPYYINKKTFIVMNKGKAIFRFSATSALYILTPLNPVRKIAIKILVHSLFSMLIMCTILTNCVFMTLSNPPDWT 
KNVEYTFTGIYTFESLIKILARGFCLEDFTFLRDPWNWLDFSVIVMAYVTEFVDLGNVSALRTFRVLRALKTISVIP 
GLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCLQWPPSDSAFETNTTS YFNGTMDSNGTFVNV 
TMSTFNWKDYIGDDSHFYVLDGQKDPLLCGNGSDAGQCPEGYICVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDYW 
ENL YQLT LRAAGKT YM I FFVLV I FLGS F YLVNL I LAVVAMAYEEQNQAT LEEAEQKEAE FQQMLEQLKKQQEEAQAV 
AAASAASRDFSGVGGLGELLESSSEASKLSSKGAKEWRNRRKKRRQREHLEGNNKGERDSFPKSESEDSVKRSSFLF 
SMDGNRLTSDKKFCSPHQSLLSIRGSLFSPRRNSKTSIFSFRGRAKDVGSENDFADDEHSTFEDSESRRDSLFVPHR 
HGERRNSNGTTTETEVRKRRLSSYQISMEMLEDSSGRQRAVSIASILTNTMEELEESRQKCPPCWYRFANVFLIWDC 
CDAWLKVKHLVNLIVMDPFVDLAITICIVLNTLFMAMEHYPMTEQFSSVLTVGNLVFTGIFTAEMVLKIIAMDPYYY 
FQEGWNIFDGI IVSLSLMELGLSNVEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIF 
AVVGMQLFGKSYKECVCKINDDCTLPRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQTMCLIVFMLVMVIGNL 
WLNLFLALLLSSFSSDNLAATDDDNEMNNLQIAVGRMQKGIDYVKNKMRECFQKAFFRKPKVIEIHEGNKIDSCMS 
NNTGIEISKELNYLRDGNGTTSGVGTGSSVEKYVIDENDYMSFINNPSLTVTVPIAVGESDFENLNTEEFSSESELE 
ESKEKLNATSSSEGSTVDVVLPREGEQAETEPEEDFKPEACFTEGCIKKFPFCQVSTEEGKGKIWWNLRKTCYSIVE 
HNWFETFIVFMILLSSGALAFEDIYIEQRKTIKTMLEYADKVFTYI FILEMLLKWVAYGFQTYFTNAWCWLDFLIVD 
VSLVSLVANALGYSELGAIKSLRTLRALRPLRALSRFEGMRWVNALVGAIPSIMNVLLVCLIFWLIFSIMGVNLFA 
GKFYHCVNMTTGNMFDISDVNNLSDCQALGKQARWKNVKVNFDNVGAGYLALLQVATFKGWMDIMYAAVDSRDVKLQ 
PVYEENLYMYLYFVI FI IFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPA 
NKFQGMVFDFVTRQVFDISIMILICLNMVTMMVETDDQGKYMTLVLSRINLVFIVLFTGEFVLKLVSLRHYYFTIGW 
NIFDFVVVILSIVGMFLAEMIEKYSVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFLVM 
FIYAIFGMSNFAYVKKEAGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSAPPDCDPDTIHPGSSVKGDRGD 
PSVGIFFFVSYIIISFLVVVNMYIAVILENFSVATEESAEPLSEDDFEMFYEVWEKFDPDATQFIEFSKLSDFAAAL 
DPPLLIAKPNKVQLIAMDLPMVSGDRIHCLDILFAFTKRVLCESGEMDALRIQMEDRFMASNPSKVSYEPITTTLKR 
KQEEVSAAIIQRNFRCYLLKQRLKNISSNYNKEAIKGRIDLPIKQDMIIDKLNGNSTPEKTDGSSSTTSPPSYDSVT 
KPDKEKFEKDKPEKESKGKEVRENQK 

Transmembrane domains. 

amino acids 124-144, 249-269, 402-422, 707-727, 740-760, 778-798, 821-841, 837- 
857, 874-894, 914-934, 1155-1175, 1191-1211, 1224-1244, 1240-1260, 1269-1289, 
1285-1305, 1351-1371, 1396-1416, 1472-1492, 1507-1527, 1538-1558, 1556-1576, 
1590-1610, 1606-1626, 1648-1668, 1700-1720 
N-glycosylation sites. 

amino acids 211-214, 290-293, 296-299, 302-305, 307-310, 339-342, 624-627, 835- 

838, 1002-1105, 1019-1022, 1085-1088, 1317-1320, 1331-1334, 1724-1727, 1873-1876 

Glycosaminoglycan attachment sites. 

amino acids 473-476, 1023-1026 

Tyrosine kinase phosphorylation sites. 

amino acids 1033-1041, 1368-1375 

N-myristoylation sites. 

amino acids 210-215, 271-276, 297-302, 303-308, 340-345, 514-519, 543-548, 752- 
757, 780-785, 827-832, 831-836, 907-912, 1005-1010, 1018-1023, 1020-1025, 1024- 
1029, 1026-1031, 1249-1254, 1281-1286, 1406-1411, 1554-1559, 1911-1916 
Cell attachment sequence, 
amino acids 1692-1694 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 851-858 

Ion transport protein homology 

amino acids 153-423, 742-934, 1190-1418, 1508-1721 

Calmodulin-binding motif. 

amino acids 1852-1872 

ATP synthase homology. 

amino acids 399-565 

ABC-2 type transporter homology. 

amino acids 1421-1636 

Alg9-like mannosyltransf erase family homology, 
amino acids 1223-1667 
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FIGURE 157 

MGAAAAQAPLGLPAASARLLLLATSVLLLFAFSLPGSRASNQPPGGGGGTGGDCPGGKGKSINCSELNVRESDVRVC 
DESSCKYGGVCKEDGDGLKCACQFQCHTNYIPVCGSNGDTYQNECFLRRAACKHQKEITVIARGPCYSDNGSGSGEG 
EEEGSGAEVHRKHSKCGPCKYKAECDEDAENVGCVCNIDCSGYSFNPVCASDGSSYNNPCFVREASCIKQEQIDIRH 
LGHCTDTDDTSLLGKKDDGLQYRPDVKDASDQREDVYIGNHMPCPENLNGYCIHGKCEFI YLLRRASCRCESGYTGQ 
HCEKTDFSILYWPSRQKLTHVLIAAIIGAVQIAIIVAIVMCITRKCPKNNRGRRQKQNLGHFTSDTSSRMV 

Signal sequence, 
amino acids 1-39 
Transmembrane domain, 
amino acids 329-34 9 
N-glycosylation sites, 
amino acids 63-66, 147-150 
Glycosaminoglycan attachment sites, 
amino acids 149-152, 151-154 

cAMP- and cGMP-dependent protein kinase phosphorylation sites, 
amino acids 165-168, 295-298 
Tyrosine kinase phosphorylation site, 
amino acids 24 6-253 
N-myristoylation sites. 

amino acids 2-7, 11-16, 36-41, 45-50, 46-51, 47-52, 48-53, 94-99, 148-153, 

150-155, 187-192, 207-212 

Amidation sites. 

amino acids 244-247, 360-363 

EGF-like domain cysteine pattern signature. 

amino acids 299-310 

Kazal-type serine protease inhibitor domain, 
amino acids 99-143, 190-235 
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FIGURE 158 

MLPEQLYFLQSPPEEEPEYHPDASAQELNVRESDVRVCDESSCKYGGVCKEDGDGLKCACQFQCHTNYIPVCGSNGD 
TYQNECFLRRAACKHQKEITVIARGPCYSDNGSGSGEGEEEGSGAEVHRKHSKCGPCKYKAECDEDAENVGCVCNID 
CSGYSFNPVCASDGSSYNNPCFVREASCIKQEQIDIRHLGHCTDTDDTSLLGKKDDGLQYRPDVKDASDQREDVYIG 
NHMPCPENLNGYCIHGKCEFIYSTQKASCRCESGYTGQHCEKTDFSILYVVPSRQKLTHVLIAAIIGAVQIAIIVAI 
VMCITRKCPKNNRGRRQKQNLGHFTSDTSSRMV 

Transmembrane domain, 
amino acids 290-310 

N-glycosylation site, 
amino acids 108-111 

Glycosaminoglycan attachment site, 
amino acids 110-113, 112-115 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 126-129 

Tyrosine kinase phosphorylation site, 
amino acids 207-214 

N-myristoylation sites. 

amino acids 55-60, 109-114, 111-116, 148-153, 168-173 

Amidation sites. 

amino acids 205-208, 321-324 

EGF-like domain cysteine pattern signature, 
amino acids 260-271 

Kazal-type serine protease inhibitor domains, 
amino acids 60-104, 151-196 

EGF-like domains. 

amino acids 38-64, 236-271 



